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Abstract. The Gowers uniformity norms WfWijkf^Q^ of a function / : G — >■ C 
on a finite additive group G, together with the sUght variant ||/||[/fc([iv]) defined 
for functions on a discrete interval [A'^] := {1, . . . , A^}, are of importance in the 
modern theory of counting additive patterns (such as arithmetic progressions) 
inside large sets. Closely related to these norms are the Gowers-Host-Kra 
seminorms WfWjjk^^x) °^ ^ measurable function / : X — >■ C on a measure- 
preserving system X = {X, X, ii,T). Much recent effort has been devoted to 
the question of obtaining necessary and sufficient conditions for these Gowers 
norms to have non-trivial size (e.g. at least 77 for some small rj > 0), leading in 
particular to the inverse conjecture for the Gowers norms, and to the Ifost-Kra 
classification of characteristic factors for the Gowers-Host-Kra seminorms. 

In this paper we investigate the near-extremal (or "property testing") ver- 
sion of this question, when the Gowers norm or Gowers-Host-Kra seminorm 
of a function is almost as large as it can be subject to an L°° or LP bound 
on its magnitude. Our main results assert, roughly speaking, that this occurs 
if and only if / behaves like a polynomial phase, possibly localised to a sub- 
group of the domain; these results can be viewed as higher-order analogues 
of a classical result of Russo |29| and Fournier |10| . and are also related to 
the polynomiality testing results over finite fields of Blum-Luby-Rubinfeld [6] 
and Alon-Kaufman-Krivelevich-Litsyn-Ron [l]. We investigate the situation 
further for the norms, which are associated to 2-step nilsequences, and find 
that there is a threshold behaviour, in that non- trivial 2-step nilsequences (not 
associated with linear or quadratic phases) only emerge once the norm is 
at most 2~^/* of the L°° norm. 



1. Introduction 

1.1. The Gowers norms and Gowers-Host-Kra seminorms. The purpose of 
this paper is to investigate functions which have an exceptionally large Gowers 
norm U''{G) or Gowers-Host-Kra seminorm U''{X). To do this we first recall the 
definitions for these norms. 

We begin with the Gowers norms U^{G). These norms are usually defined on a 
finite abelian group G — (G, +), such as a cyclic group X/NTi or a finite field vector 
space Fp, but our results are most naturally stated in the more general setting of 
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"'^The Gowers uniformity norms can also be defined on some non-abelian groups, and in par- 
ticular in finite groups or nilpotent Lie groups; this is implicit in |23) . However, we will not 
consider such generalisations here. We also note that a slightly different Gowers-type seminorm 
was constructed on ^°°(Z) by Host and Kra [24) . which we will also not study directly here. 
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a locally compact abelian (LCA) group G — (G, +) equipped with a non-trivial 
Haar measure, /i, i.e. non-trivial translation-invariant Radon measure on the Borel 
cr-algebra B of G. In the case of a compact abelian group, we will require that the 
Haar measure is normalised^ so that /i(G) — 1, but of course we cannot require this 
in the non-compact case. In particular, if G is a finite abelian group, then the Haar 
measure is the normalised counting measure /i(-E) |-E|/|G|. 

For technical reasons it is convenient to assume that the group G is second count- 
able; this condition may almost certainly be removed, but we will not do so here 
as one has the second countability axiom in all known applications of the Cow- 
ers norms. Thus, in this paper, all locally compact abelian groups are implicitly 
understood to be second countable. In particular, their Borel cr-algebra B will be 
countably generated, and G will be metrisable (by the Urysohn metrisation theo- 
rem). 

As (G, S, /x) is a measure space, it comes with the usual spaces L'p{G) = Lp{ii) = 
LP{G,B,ii) for < p < oo. We also define L^iG) to be the subspace of L°°(G) 
consisting of functions that are compactly supported; this is a translation-invariant 
algebra, with the shift actioJl T'' : L^(G) ^ L^{G) defined for /i G G by the 
formula T^f{x) := f{x - h). 

Definition 1.1 (Gowers norms on a LCA group). [T4l[T5] Let G — (G, 4-, B, ji) he a 

locally compact abelian group with a non-trivial Haar measure fi, and let f € L'^{G) 
he a function. We define the Cowers uniformity norms ||/||;7k(G) = ll/llc/'«(/^) = 
ll/llt/'«(G.B,Ai) recursively for fc = 1, 2, . . . hy the formula 

(1) \\f\\uHG)-= I f d^ji 

Jo 

and 

(2) mv^HG) (/^ UT'^mtnG) Mh) 
for k > 1. 



1/2' 



It is possible to show that the Cowers uniformity norms are seminorms for any 
A; > 1; see e.g. [TS], [TB], or [33] ■ (In these references, the seminorm property is 
only established when the group G is finite or cyclic, but the proof easily extends 
to the general LCA case.) When k = 2, we see from ^ that we have 

ll/ll;72(G) = II/* /IIl2^(g) 

where f{x) := f{—x) and / * g{x) :— Jq f{y)g{x — y) d^{y). From Plancherel's 
theorem we thus have 

(3) WfWuHG) = 11/1^4(5) 



^In many texts, the shift T*^ f is defined as T^f{x) = f{x + h) instead of T'^f{x) = f{x - h). 
The two conventions lead to an equivalent definition of the Gowers norms; this convention is 
slightly more compatible with the ergodic theory conventions, and lead to the pleasant identity 
T^Ie = ^t'^e '■~ ^E+h for indicator functions 1^;. 
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where G — (G, B, /I) is the Pontryagin dual of G, and / is the Fourier transform. 
Because of this, the U'^(G) seminorm is in fact a norm on G, and by induction we 
see that the U^{G) seminorms are also norms for all fc > 2. 

The norm can also be written more explicitly as 
(4) 

1/2*= 



WfWuHG) ■■= \ f ^ ^ n c^'"^fi^ + hiuji + --- + hkLOk) d^lix)d^lihl)...d^lihk) 

where w = (wi, . . . ,ujk), I'^l := wi + . . . + w^, and C : z i— ^ z is the complex 
conjugation map. 

From Young's inequality (see Proposition [53] below, using the exponents (|40l) ) and 
induction we obtain the inequality 

(5) WfWw'iG) < \\f\\LPk{G) 

for all fc > 1 and / G L'^(G), where pk is the critical exponent 

2'' 

associated to k, thus for instance 

Pi = 1; P2 = 4/3; P3 = 2; P4 = 16/5; . . . 

This exponent is natural from the perspective of dimensional analysis, and in par- 
ticular with respect to the operation of dilating the Haar measure /x to c/i for some 
scalar c > 0. Indeed, a short computation shows that 

\\f\\u''{G,B,cf,) = C^^'''" 11/11 [/HG^B.m) ■ 

As a consequence, as long as the Haar measure is not fixed, one cannot replace 
the Pk exponent in ([5]) by any other exponent. On the other hand, in the compact 
case, the Haar measure is normalised so that /i(G) = 1, and then ([5]) also holds for 
higher exponents than pk by Holder's inequality, and in particular 

(7) WfWuHG) < ll/IU~(G). 



From dS]), we may extend the U^{G) seminorm from L^{G) to all of LP''{G) by 
continuity. As the Fourier transform is an injection from L'^^^iG) to L'^(G), we see 
that the U^{G) norm remains a norm on L^/^{G), and by induction the [/'^(G) norm 
remains a norm on LP''{G) for fc > 2. By continuity, we see that the formulae ([l}, 
^ continue to be valid for / £ L^'' (G) (in particular, the integrands are absolutely 
integrable) . 

It is also convenient in applications to define the Gowers norms over the discrete 
interval [N] := {!,..., N}. Given a function / : [N] C and a positive integer fc, 
we define the Gowers norm ||/||c/fc([Ar]) of / on [N] by the formula 

(8) ll/lic/'»([Ar]) ||/|lc/fc(z/Arz)/l|l[JV]llc/''-(z/Jvz) 

where N is any integer with N > 2^N , [N] is embedded into the cyclic group Ti/NTi 
in the obvious manner, and / : 7i/ NTi — C is the extension of / by zero outside of 
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[N]. It is easy to sec that this definition does not depend on the precise choice of 
N. Clearly we still have the analogue of ([7]): 

(9) ll/ll;7'=([Ar]) < ll/l|L~([Af])- 

The situation with ([5]) however appears to be more complicated - there seems to 
be a loss of a multiplicative constant in the [N] setting - and we will not study it 
here. 

Next, we turn to the Gowers-Host-Kra seminorms arising from an ergodic measure- 
preserving system (X, A", /i,r), by which we mean a probability space {X^X,^) 
together with an invertible, bimeasurable, measure-preserving shift T : X ^ X 
which acts ergodically, so that the invariant space L^(X, A", /i)-^ of L^{X,X,ii) 
consists only of the constants (up to almost everywhere equivalence). We abbreviate 
"ergodic measure-preserving system" as "ergodic system" for short. 

Definition 1.2 (Gowers-Host-Kra seminorms). ^23] Let {X, X , ^,T) be an ergodic 
system, and let f € L°°{X,X,fi). We define the Gowers-Host-Kra seminorms 
WfWvix) = ll/llc/'=(-^,A',M,T) recursively for fc 1, 2, . . . by the formula 



l/ll 



f{x) dfi{x) 



X 



(10) 
and 

(11) \\f\\u^+Hx) ■■= ^Imi^ (^kelH]\\{T''f)7\\u^x) 

for k>l, where f := / o . 



1/2'=+ 



The existence of these seminorms, and the fact that these are indeed seminorms, is 
established in [23] . The definition of the norms used there appears to be somewhat 
different from that presented here, but the two definitions are equivalent, as can be 
seen from an application of the ergodic theorem; see e.g. [5] Appendix A]. 

Remark 1.1. It is possible to define the Gowers-Host-Kra seminorms on non-ergodic 
spaces as well, but in such cases the definition of the U^{X) norm must be replaced 
with 

WfWuHx) := Jini^ (e„^[h] I^T''f{x)7{x) dfi{x) 

One can relate the non-ergodic norms with the ergodic norms via ergodic decom- 
position as follows. Let X^ := {X , X^ , ii^ , id) be the invariant factor of X, thus 
X^ := {E G X : TE = E} and fi^ is the restriction of to X, and one has an 
ergodic decompositior0 /i = J-^j- /ij, d^^{y) for some T-invariant ergodic measures 
for /x-^-almost every y g X"^ that are depending in a measurable fashion on the 
parameter y G X'^ . Then one can show that 

1/2^= 

\\n\uHX)=[j^J^\\uHx,x,,,)d^F{y)^ . 



■'Strictly speaking, to construct this decomposition we need to assume that the system is 
standard Borel, in that X has the structure of a Polish space with X as its Borel cr-algebra. 
In practice, we will be able to reduce to the standard Borel case in most applications, so this 
technicality will not concern us. See [11] for further discussion. 
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From this inequality, it is possible to extend some of the results given here for er- 
godic systems to the non-ergodic setting, although the statements of the hypotheses 
and conclusions become significantly messier. Because of this, we will not discuss 
the non-ergodic case here. 

Example 1.2. The Gowers uniformity norm ||/||;7'!(z/Arz) on a cyclic group Z/A^Z 
can be viewed as a special case of the Gowers-Host-Kra seminorms, in which X is 
equal to Z/NZ with the discrete cr-algebra X := 2^^/^^, the normalised counting 
measure fJ,(E) := \E\/N, and the standard shift Tx := x + 1 (which is clearly er- 
godic); the equivalence of the two (semi)norms can be verified by an easy induction. 
More generally, if G is a compact abelian group, with an ergodic shift x t-^ x + a 
and normalised Haar measure, then an easy induction shows that the Gowers norm 
U^{G) and the Gowers-Host-Kra seminorm U'^{G) coincide. 

The analogue of Young's inequality for ergodic systems (see Proposition 17. 1|) gives 
the analogues of (O, namely that 

(12) WfWuHx) < Wfh'HX) 
and thus 

(13) < II/IIl»(x) 

for all / G L^{X). Because of this, these seminorms can be extended by continuity 
to L^'' (X) much as before. In contrast to the group case, though, these seminorms 
can vanish. Indeed, this occurs precisely when / is orthogonal to the characteristic 
factor Zk-i of the ?7'° norms, which is an important object in the theory of multiple 
recurrence; see [23] • 

Remark 1.3. The (semi)norms U''{G), U'^{[N]), U^{X) are so similar that it is 
tempting to create a very general definition of Gowers-type norms that encom- 
passes all three concepts at once. It seems that the machinery of nilspaces (see [25] , 
[8] ; the concept is a variant of the notion of a cubic complex from topology) would 
be particularly suitable for this purpose, as would a reformulation in terms of non- 
standard analysis (or by closely related devices, such as ultraproducts) . However, 
we will not attempt to perform such a unification here, thus creating three parallel 
(but closely analogous) streams of results instead. One reason for this is that there 
are some non-trivial technical differences between the proofs of the results in the 
three categories, particularly with regards to the U^{[N]) category, where the fail- 
ure of \N] to be perfectly closed under addition leads to some complications when 
one attempts to adapt arguments that were successful in the other two categories. 

1.2. Near- maximal values of the Gowers norms and Gowers-Host-Kra 
seminorms. The first main results of this paper address the question of when the 
inequalities ([7]), ([5]), ([9]), (fT2l) . ([13]) are satisfied with equality or near-equality. For 
the L°°-based inequalities (O, (O, (dU, the answer to this question is given in 
terms of the notion of a polynomial. We first define this concept on groups: 

Definition 1.3 (Polynomials on groups). Let A: > — 1 be an integer. A measurable 
map P : G ^ H between two LCA groups G, H is called a polynomial map of 
degree < k if one has 

Ah,...Ah,^,Pix)=0 
for almost every x,hi, . . . , hk+i £ G, where AhP{x) :— P{x — /i) — P{x). 
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Example 1.4. The only polynomial map of degree < —1 is the zero map. The only 
polynomials of degree < are the constant maps. The only polynomials of degree 
< 1 are afHne homomorphisms, i.e. sums of constants and group homomorphisms. 

An easy induction (using the strong continuity properties of the translation action 
T'* : P I— > P{- — h)) shows that a measurable polynomial is equal almost every- 
where to a continuous polynomial. Of course, in the case of discrete groups G, 
and in particular finite groups, all polynomials are automatically measurable and 
continuous. 

In the case of the integers G = Z, another easy induction shows that a map P : 
Z — > is a polynomial of degree < fc if and only if it takes the form 

for some coefficients cq, . . . ,Ck e H, where 

/ n\ n{n — 1) . . . (n — i + 1) 
[ij i\ 

are the (generalised) binomial coefficients. In the case when the target group H 
is divisible, which means that for every x € H and positive integer m there exists 
y G H such that my = x, we can rewrite / in the form 

k 
i=0 

for some other coefficients Cq, . . . , cj, € H. We note that any connected compact 
abelian Lie group is divisible, and in particular the unit circle R/Z is divisible. 

In the case when G is a Euclidean space R*^, and H is the imit circle R/Z, then it 
is easy to see (by approximating the Euclidean space R'* by a lattice eZ'' and then 
taking limits as £ — > 0) that a map P : R"^ — >■ R/Z is polynomial of degree < fc if 
and only if it takes the form 

f{xi,...,Xd) := ^ Ci^^...,i^xY ...x'^ modi 

ii,...,id>0:ii + ...+id<k 

for some coefficients Ci^^...^i^ G R. 

We can similarly define the notion of a polynomial on ergodic systems: 

Definition 1.4 (Polynomials on systems). Let fc > — 1 fee an integer. A measurable 
map f : X ^ H between an ergodic system X ~ {X, X,fj,,T) and a LCA group H 
is called a polynomial map of degree < fc if one has 

AH,...Ah,^,f{x)=0 

for fi-almost every x £ X and every hi, ... , hk+i G Z, where Ah '.= — 1. 

Example 1.5. Let / S L°°{X) be a non-trivial eigenfunction of X, thus / is not 
identically zero and Tf = Xf for some complex number A (which one can show 
to necessarily have magnitude 1). Then one can show that f ~ c ■ e{P) for some 
complex constant c and some polynomial P : X ^ R/Z of degree < 1, where 
e{x) := e^'^'^. Conversely, every function of this form is an eigenfunction of X. 
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Functions of the form c • e{P), where P : X — > R/Z is of degree < k, are known 
as generalised eigenf unctions of order < k. For instance, if we take the skew shift 
system X := (R/Z)^ with the standard measure and the shift T{x, y) := {x + 
a^y -\- x) for a fixed irrational a, then the function f{x,y) := e{y) is a generahsed 
eigenfunction of order < 2; note that Ahf{x,y) = e{^hx — (^'')'^) is an ordinary 
eigenfunction for every /i S Z. 

The relevance of polynomials to the Gowers norms can be seen as follows. If fc > 1, 
G is a LCA group, and P : G ^ YL/Ti is a polynomial of degree < fc — 1, then we 
have the invariance 

(14) ll/-e(P)||,;.(G) = ||/||c/HG) 
for every / € L^''{G). Similarly we have 

(15) 11/ • e(P)||c/''([w]) ^ ll/llc/nW) 

for every / € L'p*'{[N]) and polynomial P : Z — R/Z of degree < fc — 1, and 

(16) \\f-e{P)\\uHx) = \\f\\uHx) 

for every ergodic system X, every / S L^'' (X), and every polynomial P : X — > R/Z 
of degree < fc — 1. In particular, we see that equality holds in ([7]), or (|12l) when 
/ takes the form f = c ■ e(P) for a polynomial P (of the suitable type) of degree 
< fc - 1. 

Our first main results are that these are in fact the only cases in which equality 
in ([7]), (l9|), ([T2l) occurs, and they also control all the cases in which near-equality 
occurs. To formulate this properly, it is convenient to introduce the following 
notation. Given an asymptotic parameter £ > and some additional quanti- 
ties oi, . . . , a„i, A, we use Oe_yO;ai,...,a„ (^) to denote any expression bounded in 
magnitude by Cai,...,a„ (e)^, where Cai,...,a^{e) is an expression depending only on 
oi , . . . , Qm , £ that goes to zero as e — for fixed ai , . . . , Um ■ We will use the ex- 
pression 0£->.o;/c(^) particularly often, and will abbreviate this expression as o(A). 
In most situations in this paper, A will simply be equal to 1. 

Theorem 1.1 {L°° near-extremisers on compact abelian groups). Let k > 1 be 

an integer, let G be a compact abelian group, and let f € L°°{G) be such that 
||/IU-(G) < 1- Lete>Q. 

(1) (Extremisers) One has ||/||;7'«(G) = 1 */ only if f = e(P) almost every- 
where for some polynomial P : G — > R/Z o/ degree < fc — 1. 

(2) (Near-extremisers) If \\f\\u''(G) ^ 1 ~ then there exists a polynomial 
P -.G-^ R/Z of degree < fc - 1 such that \\f - e{P)\\L^c) = o{l). 

Theorem 1.2 {L°° near-extremisers on intervals). Let k > 1 be an integer, and let 
N > I be a sufficiently large integer depending on fc. Let f € L°°([N]) be such that 
||/||l-([7V]) < 1- Let e>0. 

(1) (Extremisers) One has \\f\\u''{lN]) = 1 */ '^i^d only if f = e(P) for some 
polynomial P : Z — > R/Z of degree < fc — 1. 

(2) (Near-extremisers) If \\f\\u''{[N]) ^ 1 " £j then there exists a polynomial 
P : Z ^ R/Z of degree < fc — 1 such that \\f - e(P)||^i([jv]) — o(l). 
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Theorem 1.3 near-extremisers on ergodic systems). Let k > 1 be an integer, 
let X he an ergodic measure- preserving system, and let f G L^{X) be such that 
||/||l-(X) < 1- Lete>0. 

(1) (Extremisers) One has |l/|l;7it(x) = 1 if and only if f = e{P) almost every- 
where for some polynomial P : X — > R/Z of degree < k — 1. 

(2) (Near-extremisers) If \\f\\u''{x) ^ 1 ^ £; then there exists a polynomial 
P : X ^ R/Z of degree <k-l such that \\f - e(P)||ii(x) = o(l). 

We will prove these theorems in Sections [21 [3l |4] respectively. We remark that 
Theorem II. ![ which can be interpreted as an assertion that the property of being 
a polynomial is locally testable, was essentially established in the case when G is a 
vector space over a finite field in [T] (see also [S] for the k = 2 case, or [33], [3S] for 
a more explicit formulation of this result). 

As one might expect, the proofs of the three results are very similar to each other, 
and proceed by an induction on k; they are the easiest of all the results in this 
paper to prove. The main difficulty is a "cohomological" one, namely to show that 
a certain "2-cocycle" arising from applying the induction hypothesis to "derivatives" 
(T^f)f of / is in fact a "2-coboundary" . However, when e is small enough, this 
2-cocycle takes on small values, and one can obtain this 2-coboundary property by a 
routine averaging argument. (The situation is more delicate on [N], as the domain 
is no longer shift-invariant, but a more sophisticated version of this argument still 
applies.) We will not actually use advanced cohomological tools in our arguments, 
though, and the reader may ignore the references to cohomological notation in this 
paper if desired. 

Remark 1.6. Our arguments give an effective bound on the o(l) decay rates, which 
are of polynomial nature on e. However, we will not attempt to optimise or make 
explicit these rates here. 

Next, we consider extreme or near-extreme cases of the critical inequalities (O, 
(IT31) . which compare the Gowers norms U'^ to their Lebesgue counterparts L^''. 
From the polynomial phase invariance (fT4)) . (IT6l) we expect polynomial phases e{P) 
to continue to play a prominent role. However, due to the critical nature of these 
inequalities, another object now also comes into play, namely the cosets. 

Definition 1.5 (Cosets in a LCA group). Let G = (G, -t-, /B, /i) be an LCA group. 

A coset in G is any set of the form H = xq -\- Hq, where G G and Hq < G is 
a compact open subgroup of G (which implies in particular that < Ijl{Hq) < oo). 
We define the normalisation of the coset H to be the compact abelian group Hq with 
the normalised Haar measure j^jj^f- [hq- 

Definition 1.6 (Cosets in an ergodic system). Let X = {X, X , fi,T) be an ergodic 
system. A coset in X is any measurable set H in X with the property that there 
exists an integer m > 1 such that T^^H = H up to pi-null sets, and that the 
sets H,TH, . . . ,T™~^H partition X up to jjL-null sets. (In particular, this forces 
n{H) = 1/m.) We callm the index ofH. We define the normalisation of the coset 
H to be the measure-preserving system H = {II,X [h, J[(}T)1^ IhiT™). 
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Example 1.7. li G — X — Z/NZ with the usual shift, then the above two notions 
of coset coincide with each other, and with the famihar notion of a coset from group 
theory. More generally, a coset in an ergodic system X is the same concept as a 
group-theoretic coset of the Kronecker factor Z\ of X; see Remark 18. II 

Example 1.8. An ergodic system has no nontrivial cosets if and only if is totally 
ergodic, in that is ergodic for every non-zero h. 

We observe that the normalisation of a coset H in an ergodic measure-preserving 
system X is again ergodic. Indeed, if / G L^{H) is T^-invariant, then / + Tf + 
. . . + T™~^f is T-invariant in L'^{G), where we extend / by zero to all of G, and 
the claim follows. 

The relevance of cosets arises from the following observation. If G is an LCA 
group with a coset H = xq + Hq and normalisation Hq, and / e L'P''{Hq), then 
straightforward induction shows that the function / g LP''{G) defined by f{x) :— 
^{H)^^^P^lH{x)f{x — a;o) obeys the scaling relationships 

ll/llc/''(G) = ll/ll;7'=(//) 

and 

ll/llLPfc(G) = ll/llLJ'fc(H) 

where of course we use the normalised measure to compute the norms on the right- 
hand side. Similarly, if X is an ergodic measure-preserving system with a coset iJ, 
and / G LP''{H), then another straightforward induction shows that the function 
/ G L^'' (G) defined by f{x) := fi{H)^^^P'' lH{x)f{x) obeys the scaling relationships 

\\f\\u''(X) = \\f\\u>'(H) 

and 

\\f\\L"k(X) = \\f\\LPk(H)- 

Because of this, we see that we have a more general class of extremisers for ([5|), 
p3)) . Indeed, ii H = xq + Hq is a coset of a LCA group G and P : Hq — > R/Z 
is a polynomial of degree < k — 1, then from the above discussion we see that 
the function f{x) := clH{x)e{P{x — xq)), for any constant c G C, obeys ^ with 
equality. Similarly, if is a coset of an ergodic system X, and P : — > R/Z is 
a polynomial of degree < fc — 1, then the function f{x) :— clH{x)e(P{x)) for any 
constant c G C obeys ((T3| with equality. 

Our next two main results assert that these are the essentially only means to create 
extremals or near-extremals for ^ , (fT3|) , once one avoids the degenerate case k = 1 
(for which there are clearly plenty of such near-extremals): 

Theorem 1.4 (L^''= near-extremisers on LCA groups). Let k >2 be an integer, let 
G be a locally compact abelian group, and let f G L^'' (G) be such that ||/|| LPfc(G) ^ 1- 
Let £ > 0. 

(1) (Extremisers) One has \\f\\u''{G} — 1 */ ^'^'^ only if 

fix) = fi{H)-^^'PnH{x)e(P{x - xo)) 

almost everywhere for some coset H — Xq + Hq and polynomial P : Hq — > 
R/Z of degree < k — 1. 
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(2) (Near-extremisers) If ||/||;7fc(G) 1 ^ then there exists a coset H — 
Xq + Hq and a polynomial P : Hq — > R/Z of degree < fc — 1 such that 

11/ - fi{H)-'/PnHe{Pi- - xo))|U..(G) = 

Theorem 1.5 {L^'' near-extremisers on ergodic systems). Let k > 2 be an integer, 
let X be an ergodic system, and let f G LP''{X) be such that \\f\\LPk{x) ^ 1- Let 
e > 0. 

(1) ( Extremis ers) One has ||/||;7*!(x) — 1 if and only if f — ii{H)^^^P^lHe{P) 
almost everywhere for some coset H and polynomial P : H R/Z of 
degree < k — 1. 

(2) (Near-extremisers) If \\f\\u''{X) ^ 1 ~ £i then there exists a coset H and a 
polynomial P : H ^ R/Z of degree < k ~ I such that 

\\f - ^iiH)-'/PnHe{P)U..^X) ^ 0(1). 

Wc prove these theorems in Sections [6l [8] respectively. The main idea is to use a 
classification of the near-extremisers of Young's inequality, due to Fournier [lOj . to 
reduce matters to the theory discussed earlier. (Indeed, the paper [10] already 
implicitly contains the k = 2 version of Theorem 11.41 ) To prove Theorem 11.51 we 
also exploit the theory of the Kronecker factor of an ergodic system. 

1.3. The Euclidean case. Theorem 11.41 has a corollary, which was essentially 
observed by Fournier [10] (and by Russo [29] when fc = 2): if a locally compact 
group G has no compact open subgroups, then one has 

ll/llfyHG) < Ck,G\\f\\L''k{G) 

for some constant Ck.a < 1 that is bounded away from 1 uniformly in G. In the 
case that G is a Euclidean space R" (with the usual Lebesgue measure, of course), 
we can compute the optimal value of Ck.o precisely: 

Theorem 1.6 (Sharp critical inequality for [/''(R")). Let k,n > 1. Then for any 
f e LP-iK^), one has 

(17) 11/11 t/HR") <CINI/IU''KR") 
where Ck is the constant 

(18) Cfe:=2'=/2V(fc + l)('=+i)/2'=+\ 

This constant is best possible, and when k > 2, equality is attained if and only if f 
takes the form 

(19) fix) = ce~(="~^")-*-^(^-^«)e(0(a;)) 

for some c £ C, xo G R", a positive- definite n x n matrix M , and a polynomial 
: R" ^ R/Z of degree <k-l. 



LARGE VALUES OF THE GOWERS-HOST-KRA SEMINORMS 
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The first few values of Ck are 

Ci 

C2 

C4 

We prove Theorem ll.6l in Section[9l Our main tool is the sharp Young inequality due 
to Beckner [3] and Brascamp and Lieb [7^ , and its converse. It is quite likely that one 
could also establish near-extremiser results analogous to those appearing previously 
in this introduction, but we will not attempt to do so here. (A starting point would 
be to first establish near-extremiser results for the sharp Young inequality, which 
could perhaps be deduced using the machinery from [5], [3].) 

1.4. Threshold behaviour for the norm. It is natural to ask how small the 
quantity 1 — e appearing in results such as Theorem 11.41 or Theorem 11.51 can be. 
For fc = 1, 2 we can lower 1 — e all the way to zero: 

Theorem 1.7 (Inverse theorem for k — 1,2). Let fc = 1, 2 and let e > 0. 

• If G is a compact abelian group (with normalised Haar measure) and f £ 
L°°{G) is such that ||/||l°°(g) ^ 1 '"^'^ ll/ll;7''(G) — then there exists a 
polynomial P : G ^ R/Z of degree at most fc— 1 such that |(/, e(P))/^2((3) | > 
c{k,e), where c{k,e) > is a quantity that depends only on k and e. 

• If N is a positive integer and f e L°°([Af]) is such that ||/||L°°([Af]) ^ 1 '^i^d 
\\f\\u''{[N]) ^ then there exists a polynomial P : [N] — > R/Z of degree 
at most fc — 1 such that | (/, e(P))^2([7v]) I ^ c(fc,£), where c{k,e) > is a 
quantity that depends only on fc and e. 

• If X is an ergodic system and f G LP''(\) is such that ||/|| LPfc(G) "Si 1 md 
ll/ll(7'=(G) > 0; then there exists a polynomial P : X — > R/Z of degree at 
most fc — 1 such that \ (/, e(P)) l'^(x) I > 0. 

Proof. The case fc = 1 is easily verified by inspection, so we establish the fc = 2 
case only. In the group case, we can use ([3]) and Plancherel's theorem to conclude 
that 

ll/ll.^(G) < 1 

and ^ 

ll/ll£*(G) - ^ 

which implies that there exists ^ G G such that \f{£,)\ ^ ^'^j ^^^d the claim follows. 

The interval case [N] follows easily from the group case with G = Z/NZ for a 
suitable N. In the ergodic system case, it is possible to perform a similar argument 
using the spectral decomposition of the shift T, but we will instead use the (closely 
related) theory of the Kronecker factor from Host and Kra [23j . It suffices to show 
that if / e L^/^{X) is orthogonal to all eigenfunctions of X, then it has a U'^{G) 



2I/2 

22/4 ' 
22/4 

23/8 
44/ 16 
24/16 
55/32 



0.9367 



0.9248. 



0.9170 
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norm of zero. Let Z\ be the sub-cr-algebra of X generated by the eigenfunctions of 
X (or equivalently, by the pure point spectrum of T) , then / is orthogonal to Zi . 
The claim then followfQ from [531 Lemma 4.3]. □ 

However, the situation changes when k ~ S: the parameter e in Theorem 1 1 . 71 can 
no longer be lowered all the way to zero. In the group case (and specifically, for 
cyclic groups G = Z/NZ with TV large) and in the case of the interval [N], this was 
observed by Gowers [14] ; the analogous observation for ergodic systems is implicit 
in the work of Furstenberg and Weiss [12], [13] (in the closely related context of 
multiple recurrence), and more explicitly in the work of Host and Kra [23j . On the 
other hand. Theorem 1 1 . 71 for arbitrarily small s can be recovered in the fc = 3 case 
for some groups G, such as vector spaces G — F'^ over a fixed characteristic p; see 

m, m- 

Our final main results are to compute the precise threshold for which Theorem 1 1.71 
continues to hold in the fc = 3 case, provided that the group G is cyclic, or that the 
system X is totally ergodic. More precisely, we have 

Theorem 1.8 (Sharp U'^ threshold for systems). If X is a totally ergodic system 
(thus is ergodic for every non-zero h) and f e L?'{X) is such that \\f\\L'^{x) ^ 1 
||/||;73(x) > 2""'^/®, then there exists a polynomial P : X R/Z of degree 
at most 2 such that \{f ,e{P)) ]:^-2(^x)\ > 0- Furthermore, the constant 2^^!^ is best 
possible. 

Theorem 1.9 (U^ threshold for intervals). If N is an integer and f E L°°{[N]) is 
such that \\f\\L'^([N]) ^ 1 \\f\\u^([N]) ^ 2~^/^ + 77 for some 7] > 0, then there 
exists a polynomial P : [N] — R/Z of degree at most 2 such that \ (/, e(P))i2Q^]) | > 
c{ri) for some c{r]) > 0. 

Theorem 1.10 {U^ threshold for cyclic groups). // N is an integer and f E 
L°°(Z/iVZ) is such that ||/||l~(z/7VZ) < 1 and ||/||(73(z/Arz) > 2"^/® + 77 for some 
rj > 0, then there exists a positive integer 1 < 9 < G{r]) and a polynomial P : 
Z/qNZ — > R/Z of degree at most 2 such that \{f, e{P)) ]^2(^2,/qNZ) \ > c{rj) for some 
C{ri),c{r]) > 0, where we lift f from Z/NZ up to Z/qNZ in the obvious manner. 

We prove these theorems in Sections [TOl [Tl] [T2] respectivelv. The constant 2^^/^ is, 
not coincidentally, the constant C3 that appears in Theorem ll.6[ and indeed we will 
use Theorem 11.61 as a key tool in establishing the above results. We will also rely 
crucially on the inverse theorems of Host and Kra ,23^ (in the ergodic case) and of 
Green and Tao [17 (in the cyclic group and interval cases), and on the structural 
theory of 2-step nilmanifolds (as developed for instance in [2^). We also crucially 
take advantage of the fact that the critical exponent p2 is equal to 2, allowing us 
to exploit Plancherel's theorem at a key juncture to simplify the expressions being 
computed. The relevance of nilsystems for Gowers-type norms was first explicitly 
noted in [23], although nilsystems had also been implicitly linked to the closely 
related problem of multiple recurrence in [9], |13] . [22] . 



Strictly speaking, the lemma cited only applies when / lies in L°°(X), but it can be easily 
extended to / in L'^^^{X) by a limiting argument using I I12II . 
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The total ergodicity hypothesis in Theorem ll.Sl is needed to rule out a certain type 
of dynamical system that is not obviously modeled by the Euclidean case from 
Theorem 11.61 and it is possible that the l"^!^ threshold may change if one no 
longer assumes total ergodicity; see Remark 110.21 We will deduce Theorem 11.101 
from Theorem 11.91 The need to lift up to a finite extension Z/gA^Z of Z/iVZ is 
technical, and is due to functions / : Z/A^Z — ^ C such as 

/(n) e(T?lqN) 

for n = 0, . . . , - 1, which has a large U^{Z/NZ) norm, but has low correlation 
with e{P) for any quadratic P : Ti/NTi R/Z with period N, if TV is large 
compared with q. It may be possible that the threshold 2^^/* is high enough to 
exclude such examples, but we have not pursued this matter here. We do not know 
if the constant 2~^/^ is best possible for Theorem ll.lOl there is a technical difficulty 
when trying to embed the sharp counterexamples from an interval [N] into a cyclic 
group 7i/N7i with no loss in constants. We also do not know if the constant 2~^/^ 
is best possible for Theorem 11.91 due to the issue of a possible gap between the 
L°°([iV]) and L'^{[N]) norms. 

We do not know what the analogues of Theorems ll.8[|1.9 l ll.l0l are for higher step. 
The inverse theorem of Host-Kra and Green- Tao extend to higher step (see [23] . 
[2T] respectively), which essentially reduces matters to checking that the claimed 
assertions are true for nilsystems (in the ergodic case) or for nilsequences (in the 
interval and cyclic group cases). However, the structure of nilmanifolds of step 
3 or higher is significantly more complicated than in the 2-step case, but more 
importantly, the critical exponent pk is no longer equal to 2, and certain exact 
identities arising from the Plancherel theorem are no longer available to simplify 
the problem. 

The authors are indebted to Tim Austin for posing these questions, and the anony- 
mous referee for many useful comments and corrections. 

2. L°° NEAR-EXTREMISERS ON GROUPS 

We now prove Theorem I l.li which is the easiest of the near-extremiser results. We 
will just prove the claim for near-extremals with e > 0; the claim for extremals 
is simpler and can in any event be obtained from the e > case by a limiting 
argument (or by adapting the proof). 

We will need a simple lemma from Lemma C.l]: 

Lemma 2.1 (Separation lemma). Let k > 1 be an integer, and let P : G —>■ R/Z 

be a polynomial of degree < k such that ||e(P) — 1|| l2(g) < 2"'^"'"^/^. Then P is 
constant. 

Proof. When k — 1 the claim is immediate from Fourier analysis and Pythagoras' 
theorem (since e(P) is orthogonal to 1 when P is a non-constant linear polynomial), 
so suppose that k > I and the claim has already been proven for k — 1. If 

||e(P)~l|U.(G)<2-'=+i/2^ 
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then for any h € G we have 

lle(T'^P) - < 2-'=+i/2, 

and thus by the triangle inequahty 

||e(r''P)-e(P)|U.(G) <2-''^+3/2, 

which we rearrange as 

||e(A„P)-l|U.(G) <2-(^-i)+i/2. 

By induction hypothesis, this imphes that A^P is constant for every h, thus P is 
of degree < 1, and the claim then follows from the k = 1 case. □ 



One consequence of this lemma (and the second countability of G, which implies 
that L'^{G) is separable) is that there are at most countably many polynomials P 
of a given degree, up to constants and almost everywhere equivalence. 



We now return to the proof of Theorem ll.il We begin with the easy case k = 1. 
Here we have ||/||l°^(g) < 1 and 



G 



> 1-e. 



By rotating / by a phase (which does not affect either the hypothesis or conclusion 
of the theorem) we may assume that 



Re / f dn>l-e 
Jg 

and thus 

/ (1-Re/) d^l^o{l). 
Jg 

By Markov's inequality, we thus have 1 — Re/(a;) — o(l) for 1 — o(l) of the values 
of a: € G, which implies that ||/— 1||li(G) = o(l), and the claim follows. For future 
reference we remark that the above argument did not use the group structure of G, 
and thus also establishes the fc = 1 case of Theorem 11.21 and Theorem 11.31 



Now assume inductively that k > 2, and that Theorem 1 1.1 1 has already been proven 
for fc — 1. It will be convenient to relax the hypothesis ||/||[/'«(g) — 1 ~ e to 
||/||c/''(G) > 1 — o(l), as this will give us the freedom to perturb / by o(l) in 
the U'*'{G) (and hence U^{G)) norm, subject of course to the constraint that the 
L°°{G) norm of / remains bounded above by 1. 

It will be convenient to adopt the following notation: a measurable subset of G will 
be called very dense if its measure is 1 — o(l). 

We may assume that e is small, as the claim is vacuous for large e. From ([5]) we 
have 

II/IIl^.(G) = 1-o(1) 

and thus (since ||/||lo=(g) ^ l)i we have |/| 1 — o(l) for a very dense set of points 
X in G. Thus, by modifying / by o(l) in U^''{G) norm, we may assume without 
loss of generality that |/| = 1 everywhere. 
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Next, we use ([2]) to observe that 

ll(rV)7ll?,-MG) d^,ih) = 1-0(1). 

Since 

ll(TV)7llc/-nG)<ll(7^'7)7llL~(G) = l, 

we conclude from Markov's inequality that 

ll(TV)7llc/-nG)-l-0(l) 

for all /i in a very dense subset of G. Applying the induction hypothesis, we see 
that there exists a very dense subset ^ of G such that for all h £ A, there exists a 
polynomial P/i : G — > R/Z of degree < k — 2 such that 

||(TV)7-e(P,)|Ui(G) 

We now pass from the very dense set A to all of G as follows. Let h E G. As A 
and h — A both have density greater than 1/2 in G, they must intersect, thus we 
can write h = a -\- b for some a,b £ A. By hypothesis, one has 

||(r«7)7-e(Pa)IUi(G) = o(i) 

and 

ll(rV)7-e(n)|Ui(G)=o(l). 
Since |/| = 1, we conclude that 

l|TV-e(Pa)/|lLi(G)-o(l) 

and 

||r"+''/-e(r'^P6)TVIlLi(G) = 

From the triangle inequality, we conclude that 

l|T'7-e(™ + Pa)/||LMG)=o(l). 

The expression T'^Pb + Pa is a polynomial of degree < fc — 2. We have thus shown 
that for every h E G, there exists a polynomial Qh ■ G R/Z of degree < fc — 2 
such that 

l|rV-e(g^)/IUi(G) = 0(1). 

Fix a Qh for each h E G. We may select Qh in a Borel measurable manner, because 
there are only countably many polynomials up to constants; see [3 Appendix C]. 

If h, h' E G, then by repeating the previous argument we see that 

WT'^+'^'f ~ e(T"g„, + Qh)f\\LHG) = o{l). 

On the other hand, 

\\T'^+'''f-e{Qh+h')f\\L^(G)=o{l). 
From the triangle inequality (and the fact that |/| = 1) we thus have 

\\e{Qh+h' - T'^Qh' - Qh) - 1\\lhg) = o{l) 
which (by the boundedness of e{Qh+h' — T^Qh' — Qh) — 1) implies that 
(20) \\e{Qh+h' - T'^Qh' - Qh) - = o(l). 



i 
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For e small enough, we thus conclude from Lemma [2.11 that Qh+h' — T^Qh' — Qh 
is constant for every h, h' e G, thus 

(21) Qh+h' - T'^Qh' -Qh = Ch,h' mod 1 

for some Ch.h' € R- From (^0]) we can assume that Ch,h' — o(l). As Qh depends in 
a Borel measurable fashion on h, we see that Ch.h' depends in a Borel measurable 
fashion on h,h' . 

From ([2T|l we obtain the equation 

ChM' + Ch+h',h" = Chji'+h" + Ch',h" mod 1 

for all h, h' , h" £ G] since the Ch.k' — o(l), we can remove the mod 1 projection 
here and conclude that 

(22) Ch.h' + Ch+h'M" = Ch,h'+h" + Ch',h" 

for aU h,h',h" e G. 

If we average ((23|l in h" , we obtain a relation of the form 

(23) ChM' =b{h) + b{h')-b{h + h') 

where b{h) :— ^^Ch.h" d^{h"). Since Ch^" — o(l), we have b{h) ~ o(l) also. 

Remark 2.1. In the language of cohomology for dynamical systems, the equation 
([22|) asserts that the map (/i, h') H> Ch^' is a 2-cocycle, and the formula (I23|) asserts 
that this map is in fact a 2-coboundary. Thus we have shown that all sufhciently 
small R/Z-valucd 2-cocycles are 2-coboundaries. Once one moves away from the 
near-extremal regime, then genuinely non-trivial cocycles begin appearing, which 
ultimately forces one to replace polynomials by nilsequences and related objects; 
see [23] . 

Now set Qh ■= Qh + b{h). Then Qh is still a polynomial of degree < k — 2, and we 
still have 

(24) r'7-e(Q/.)/l|Li(G) =0(1) 

for all h E G. From (PT|) and (|23p we have the cocycle equation 

Qh+h' = T^Qh' + Qh 

for all h, h' E G. Specialising this to 0, we see that 

Qh'i-h) = cj)i-h-h')-(j){-h) 
for all h, h' E G, where (f>{x) := Q-x{0)', thus 

Qh := A,i(/). 

As <j) is Borel measurable and all derivatives Ah<j> of <j> are polynomials of degree 
< fc — 2, is of degree < A: — 1. We can then rewrite (|24|) as 

rv-/iiLHG) = o(i) 

where f ■— f ■ e{—(f)). Averaging in h using Minkowski's inequality we see that 

II/- c|Ui(G) = 0(1) 

for some constant c. As |/| = 1, we may take |c| = 1. We then have 

||/-c-e(0)|Ui(G)=o(l) 
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and the claim follows. This completes the proof of Theorem ll.il 

3. NEAR-EXTREMISERS ON INTERVALS 

We now prove Theorem ll.2l The arguments will closely follow the proof of Theorem 
11.11 in the previous section, but localised to intervals such as [N]. Again, we only 
treat the e > case. 

For fc = 1, the proof proceeds precisely as in the previous section, so we assume 
inductively that k > 2 and the claim has already been proven for k — 1. 

As in the previous section, we will just prove the near-extremiser claim, and relax 
the hypotheses to ||/||L~([Ar]) < 1 and ||/||;7''([7V]) ^ 1 ^ o(l). We may assume that 
e is positive but small, as the claim is trivial for large s. 

It wiU be convenient to introduce the expectation notation Ex£Af{x) := ^ J2xeA /(^) 
for any finite non-empty set A. We also say that a set _B is a very dense subset of 
a finite non-empty set A if one has \B\ > (1 — o(l))|A|. 

We embed [N] in Z/NZ for (say) N := 2*^^^ + 1, and extend / by zero to a function 
from Z/NZ — > C, which by abuse of notation we will still call /. From ([8|) we have 

ll/llc/''(Z/JVZ) ^ (1 ^ o(l))l|l[Ar]llt/fc(Z/JVZ) 

and hence by Q 

^hez/Nz\\'^''ff\\u''-^z/ffz) ^ (1 ^ o(l))II^hez/wzP/i+[Af]l[^v]llt/'=-i(z/Jvz)- 

The expectation on the right-hand side can be computed to be comparable to one 
(with bounds depending on k), and thus 

^heZ/Nz\\'^h+[N]'^[N]\\ui'-i{Z/ffZ) ~ ll^''//ll[/'=-i(Z/7VZ) = 

II/IIl°°(z/jvz) — expression inside the expectation is non-negative. Thus, 

by Markov's inequality, one has 

l|l/i+[JV]l[Ar]||(7fc-i(Z/ArZ) ^ ll^''//ll[/fc-i(z/Arz) = ^(1) 

for all /i in a very dense subset of Z/NZ. In particular, for all h m a, very dense 
subset of [Af/2], one has 

11^ '''ffllv-^Z/NZ) — W^lN-^Wuk-^Z/NZ) ^Ci(l). 

The norm on the right-hand side can be calculated to be comparable to one (with 
bounds depending on k). If we thus let //i : [iV — /i] — > C be the restriction of 
r^''// to [N -h], we conclude from ^ that 

\\fh\\u>'-H[N~h]) = 1 - o(l). 

Applying the induction hypothesis, we conclude that for all ft, in a very dense subset 
H of [iV/2], there exists a polynomial P/j : Z R/Z of degree < fc — 2 such that 

(25) \\.h-eiPh)\\LiaN-h])=o{l). 
In particular, by the triangle inequality 

\\\h\-i\\miN-k]) = o{i). 
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Using the crude bound \fh\ < |/|, \T '^f \ < 1, wc thus have 

E,e[jv-/.]1- 1/(^)1 = 0(1) 

and 

^.e[N-h]+hl-\f{x)\=o{l) 
for b1\ h Cz H, and so by the triangle inequahty 

IE.e[w]l - 1/(2^)1 

From this and Markov's inequahty we conclude that |/(a;)| = 1 — o(l) for all x in 
a very dense subset of [N]. We may thus modify / by o(l) in 7(fc+i) ([^v]) 

norm 

and assume without loss of generality that |/(a;)| = 1 for all x e [N]. 
We return now to (|25|). which we can now rewrite as 
(26) \\T-''f-e{Ph)f\\LHlN-h])=o{l) 
for all h e H. 

Let h e [— Then, as H contains a very dense subset of [-/V/4], we can 
find a,b e H n [N/A] such that b = a - h. From ^E^i one has 

|iT-V-e(Pa)/|Ui([3JV/4])=0(l) 

and 

WT-'^+^f - e{Pa-h)f\\LHl3N/i]) ^ o{l) 

and thus by the triangle inequality 

\\T--+\f - e{Pa-h ~ Pa)T-VllLi([3JV/4]) = o{l) 

and thus 

IIT'7 - e{T^{Pa-H - Pa))f\\L^i[N/2]) = o(l). 

Thus, for each h e [— -/V/8], we can choose a polynomial Qh of degree < k — 2 
such that 

l|T'7-e(Q,0/||Li([iv/2]) = o(l). 
Now let h, h' E [-N/16, N/16]. Then we have 

llrV-e(Q„)/IUM[w/4]) =0(1) 
||r''+'''/-e(T'^Q,0/IUHW4])=o(l) 
||T''+"7-e(Q,+,0/IUi([iv/4]) =0(1) 
and thus by the triangle inequality and the fact that |/| = 1 on [A^/4], we have 

\\e{Qh+h' - T'^Qh' - Qh) - M\l^[n/4]} = o(l) 
for aU h, h' e [-iV/16, iV/16]. 
We now need a variant of Lemma 12.11 

Lemma 3.1 (Separation lemma). Let fc > 1 and N > 1 be integers, and let 
P : [N] — > R/Z be a polynomial P{n) = X]i=o degree < k and coefficients 

Co,...,Cfc such that \\e{P) — l||L2([Ar]) = o(l). If N is sufficiently large depending 
on k, then for each < i < k, one has Ci — o{N^'') mod 1. 
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Proof. For k = 1 the claim follows by direct calculation of geometric series (or by 
Fourier analysis), so suppose that k > 2 and that the claim has already been proven 
for fc — 1 . Arguing exactly as in the proof of Lemma 12.11 we see that 

l|e(A„P)-l||i2([^/2j)=o(l) 

for all h € [— iV/2, A^/2]. By induction hypothesis, this implies that for each 1 < 
i < k — 1, the i*'' Fourier coefficient of A/jP is o{N~^) mod 1. When i = fc — 1, 
this coefficient is khck, thus khck = o{N-''+^) mod 1 for all h e [~A^/2, N/2]. We 
conclude that Ck — Qk + o(N~'^) mod 1, where qk is a multiple of 1/fc. If we then 
turn to i — k ~ 2, and restrict ft, to be a multiple of fc to eliminate the contribution 
of qk, we see that the (fc - 2)*^ coefficient of A^.P is (fc - l)hck-i + o{N-''+'^) 
mod 1. Repeating the previous argument, we see that Ck-i — qk-i + o{N~'^) 
mod 1 where qk-i is a multiple of l/fc(fc — 1). Iterating this, we eventually obtain 
Ci = qi + o{N~^) mod 1 for all 1 < i < fc and some qi that is a multiple of 1/fc!. 
The o{N~'^) component of Ci only influences e(P) by o(l) on [N], so without loss 
of generality we may assume that Ci = qi, t hus P is now periodic with period fc!, 
and can be viewed as a function on Z/fc!Z. If N is large enough (e.g. N > kl will 
suffice), this implies that 

MP)-i\\mz/k<z)=oii). 

The claim now follows from Lemma |2. II □ 

Applying this lemma, we thus see that for all h,h' £ [— A''/16, A^/16], the degree 
< fc — 2 polynomial 

(27) ChM' ■— Qh+h' — T'^Qh' — Qh 

is nearly constant in the sense that its i*^ coefficient is o(A^~*) for each < i < k—2. 
From ((27)) we see that the Ch^h' obey the 2-cocycle equation 

Ch,h' + Ch+h'Ji" = Chji' + h" + T^^Ch'M"', 

this is similar to the situation in the previous situation, but the Ch,h' are now 
polynomials instead of constants (which, among other things, introduces the shift 
in the above equation). However, one can remove the polynomial coefficients as 
follows: 

Lemma 3.2. For any —1 < j < k — 2, there exists < 6j < 1/16 depending only 
on j and k, and a polynomial Qh of degree < fc — 2 for each h e [—SjN, SjN], such 
that 

(28) l|TV-e(Q0/|Ui([w/2]) = o(l) 
for all h G [—5jN,5jN], and such that 

Ch.h' ■— Qh+h' — T^Qh' — Qh 

is a degree < j polynomial with an i*'* coefficient of o{N^^) for each 1 <i < j and 
all h e [-5jN/2,6jN/2]. 

Proof. We perform downward induction on j. The case j = k — 2 follows from the 
previous discussion. Now suppose that < j < fc — 2 and that the claim has already 
been proven for j + 1. Let Qh, Ch,h',Sj+i be the objects generated by the induction 
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hypothesis. If we let cji^'lP be the (j + 1)*'' coefRcient of Ch,h', identified with an 

element of the fundamental domain (—1/2, 1/2], then we have c'j^^P = o{N-^-^), 
and we have the 2-cocycle equation 

-0+1) , -0+1) _ , 

forall/i,/i',/i" G [~5j+iN/2,5j+iN/2]. We average this over all /i" G [-5j+iN/2,5j+iN/2]. 
If /i' = 1, we have 

and so we have 

Cm + + 1) = a{h) + a(l) + o(7V-J'-2) 
for h e [-(5j+i7V/2,5j+iAr/2], where 

a{K) ^h"(i[-S^^^N/2,5, + iN/2]C^hX^) ■ 

From the bounds on c''i^^},\ we have a{h) — o{N^^^^). If we then set Q'f^in) := 
Qh{n) + ■j^O'{h)n^~^^ to Qhin), then also obeys the required property (^5)) 
(with a slightly different value of o(l)), and the {j + 1)*'' coefficient (c'^ /i')^"'^^'' of 
the degree < j + 1 polynomial 

Q'h+h' ~ T^Q'h' ~ Q/i 

is given by 

(4,/.')^'+'^ = (£h,/.')^'+'' + a{h + /i') - a{h) - a{h') 

and thus 

(c')i';r) = o(7v-^-2) 

for G [— (5j+iA^/2, (5j-|_iA^/2]. Of course, (c')jf^,^'' also obeys the cocycle equation 
and vanishes when h or h' equals zero. 

Now define h{h) for h G [— ^j+i Af/4, (5j+iiV/4] by requiring that 

6(0) = 

and 

for aU h G [-(5j+i7V/4, 5j+iiV/4], thus h{h) = o(7V-J-i) for all h G [-5j+iiV/4, (5j+iAr/4]. 
From the cocycle equation and induction on h' one verifies that 

{cln')^=^''^^b{h + h')-b{h)-b{h') 
for aU h, h' G [-Sj+iN/8, Sj+iN/8]. If wc thus set 

J + 1 

for all h G [— (5j+iA^/8, J^+i A^/S], then Q'^ also obeys the required property (p8)) 
(with a slightly different value of o(l)), and the (j + 1)*'* coefficient of the degree 
< J + 1 polynomial Q',[^,^, - T''Q',[, - g'^ vanishes for /i, /i' G [Sj+iN/S, 6j+iN/8], 
so that this polynomial in fact has degree < j. The claim follows (setting Sj := 
<5,+i/8). □ 
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We apply this lemma with j = — 1 to obtain polynomials Qh of degree < fc — 2 for 
each h E [—d-iN,6-iN] obeying ([SS)) and such that 

Qh+h' — T^Qh' — Qh — 

for all h,h' e [-d-iN/2,6-iN/2]. In particular, setting F{h) := Qh{0) for h e 
[-5_iiV/2,(5_i7V/2], we have 

Qh{x) = A^Fix) 

for all h,x G [—6^iN/A,6-iN/4]. As each Qh is a polynomial of degree < k — 2, 
this implies that 

A,,, ...A,,,F(x) =0 

whenever h,x E [— (5_iiV/4fc, (5_iA^/4/c]. Setting hi = . . . = hk = I and inducting 
on k, we conclude that there exists a polynomial F : Z R/Z of degree < k — 1, 
thus 

fe-i 

1=0 

for some fo,..., fk-i G R/Z, such that F{x) = F{x) for aU x e [S-iN/Ak, S^iN/Ak]. 
We conclude that Qh and AhF agree on [-5_i7V/8fc, (5_i7V/8fc] for ah h e [-5^iN/8k, S-iN/8k]. 
By polynomial interpolation, this implies that Qh — AhF for all h e [~6-iN/8k, 5_iiV/8fc]. 
From (|28| we thus have 

||r'7-e(A„/')/|Ui([^/2]) =0(1) 
for aU /i e [-~S-iN/8k, S-iN/8k]; if we then set / / • e(-/'), then we have 

l|r'7-/IUM[w/2]) = o(i) 

for aU h e [-(5_iA^/8fc,J_i7V/8fc]. 

Note that if we multiply / by a polynomial phase such as e{—F), then both the 
hypothesis and conclusion of Theorem 11.21 remain unchanged (and / remains of 
magnitude 1). Thus we may assume without loss of generality that / = /, thus 

l|rV-/llLi([iV/2])-o(l) 

and so (recalling the definition of /;, = T''//), 

Wfh - l||Li([Af/2]) = o(l). 

Comparing this with (pSj) . we see that for all h in a. very dense subset of [5-iN/8k], 
one has 

MPh) - 1||li([jv/2]) = o(l); 
applying Lemma |3. 11 we conclude that 

MPh) - M\L^{[N-h]) = o(l) 

and hence that 

Wfh - iWmiN-h]) = o(i) 

or equivalently 

\\T''f-f\\LHlN-h])=o{l) 
for a all ft, in a very dense subset H of [—S-iN/8k]. 
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Observe that if h, h' G H , then by the triangle inequahty we have 

r'^+'v-/iiLi([iv-/.-h']) 

Iterating this a bounded number of times, we conclude that 

\\T^f-f\W([N^H])^0{l) 

for all hva. a. very dense subset of [N]. Thus 

Kez/NzT^f7i^) - TH^n\\n\{x) = o(l) 

for all /i in a very dense subset of By reflection symmetry, we may extend 
this to all /i in a very dense subset of [— iV, iV] = {— iV, . . . , A^}, and then (by the 
support properties of /) to a very dense subset of Z/iVZ. Averaging over /i, we 
then see that 

K,h^z/NzT''f7ix) - T%N]liN]ix) = o(l) 

which simplifies to 

Kez/Nzfi^)\' - Kez/NzMN]{x)\^ = o{l) 

and hence 

ll/lli/i([JV]) = 1 - o(l). 
The claim now follows from the k = 1 case of the theorem. 

4. L°° NEAR-EXTREMISERS ON SYSTEMS 

We now prove Theorem 11.31 Unsurprisingly, the arguments will closely follow the 
proofs of Theorem 11.11 and Theorem 11.21 

For fc = 1, the proof proceeds precisely as in the previous sections, so we assume 
inductively that k > 2 and the claim has already been proven for k — 1. 

As in previous sections, we will just prove the near-extremiser claim, and relax the 
hypotheses to ||/||loc(x) ^ 1 and ||/||;7fc(x) > 1 — o(l). We may assume that s is 
positive but small, as the claim is trivial for large e. 

We call a subset A of the natural numbers very dense if the lower density liminf/f_>.oo 
[i/]|isl-o(l). 

From ([T^ we have 

ll/IU^.(x) = l-o(l), 
so by arguing exactly as in Section [21 we may assume without loss of generality 
that 1/1 = 1 everywhere. 

From ([TT|) one has 

hm E,e[H]||(rV)7irJ':-i(^) = 1 - o(l). 

Since 

ll(TV)7ll£/-.m<ll(TV)7lU~m = i, 

we conclude from Markov's inequality that 

ll(rV)7llc/-iw = 1-0(1) 
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for all h in a very dense subset of N. Applying the induction hypothesis, we see 
that there exists a very dense subset A of N such that for all h € A, there exists a 
polynomial Ph : X R/Z of degree < fc — 2 such that 

||(rV)7-e(p,)|Ui(x)-o(i). 

As in Section [21 the next step is to pass from A to all of Z. Let h E Z. As A 
and h + A both have lower density 1 — o(l), we thus must have a representation 
h — a — b for some a,b € A. By hypothesis, one has 

||(TV)7-e(P,)|U.(^),||(TV)7-e(n)|U.(^)=o(l). 
Since |/| = 1, we conclude that 

IIT--"/ - e(r-^Pa)r-Vl|Li(x), \\T-'f - e{-T-'Pb)f\\L^ix) = o{l) 
and thus by the triangle inequality 

\\T^-\f - eiT-'Pa - T-''Pt)f\\mx) - 

The expression T^^Pa — T^^Pb is a polynomial of degree < fc — 2. We have thus 
shown that for every h G Z, there exists a polynomial Qh ■ X — > R/Z of degree 
< fc — 2 such that 

\\T''f-e{Qh)f\\LHx)=o{l). 

Fix a Qh for each /i e Z. 

If ft,, h' e Z, then by arguing as in Section [2] one has 

(29) \HQh+h' - T'^Qh' - Qh) ~ l\\mx) - o(l). 

By the analogue of Lemma [2TT] for systems (see O Lemma C.l]), we conclude that 
Qh+h' — T^Qh' — Qh is constant for every ft, h' G Z, thus 

(30) Qh+h' - T'^Qh' - Qh = ChM' mod 1 
for some Ch.h' '= From ([29]) we can assume that Ch^h' — o(l). 
From ([50)1 we obtain the 2-cocycle equation 

Ch,h' + Ch+h',h" = Chji'+h" + Ch',h" mod 1 

for all ft, ft', ft" G Z; since the c;i,h' = o{l), we can remove the mod 1 projection 
here and conclude that 

Ch,h' + Ch+h',h" — Ch,h'+h" + Ch' ,h" 

for aU ft, ft', ft" e Z. 

We perform the averaging trick. As Z is amenable, there is a translation-invariant 
mean A : £°°(Z) R on the bounded real sequences on Z. Applying this mean to 
average in ft", we obtain the relation 

(31) Ch,h' =b{h) + bih')~bih + h') 

for each ft, ft' G Z, where 5(ft) = o(l) depends only on ft. Now set Qh ■— Qh + b{h), 
then Qh is still a polynomial of degree < fc — 2, and wc still have 

(32) \\T''f-e{Qh)f\\LHx)^o{l) 
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for all /i g Z. From (j30| and ([31]) we have the cocycle equation 

(33) Qh+h' = T'^Qh' + Qh 
for ah h, h' e Z. 

We rewrite (|32|) as 

||/-e(-Q;,)rVllLim=o(l). 
By the triangle inequality, we thus have 

(34) 11/ - E,,e[H]ei-Qh)T''f\\LHx) = o{l). 

We now claim that the expression E^g[/f]e(— (3/i)T''/ converges in L^{X) norm to a 
limit F. This can be seen from the mean ergodic theorem. Indeed, if one considers 
the cocycle extension X X x R/Z with shift T : X X defined by 

f{x,e) {Tx,0 + Qi{x)) 

then from ([33]) and induction one has 

f'ix^e) := {T''x,0 + Qhix)) 

for every h E Z. If one then lets / G L°°{X) be the function 

f{x,e) ■.^f{x)e{~e) 

then we have 

(35) Eh^[H]f''fix, 0) = Ehe[H]ei-Qhix))T''f{x)e{^0). 

By the mean ergodic theorem, the left-hand side converges in {X) to a T-invariant 
function F, which must then take the form F{x)e{—0) by an inspection of the right- 
hand side of ([55]) . and the claim follows. 

From ([34)) one has 

(36) \\f-F\\mx)=o{l). 
From the T-invariance of F, one has 

(37) T'^F = e{Qh)F 

for all ft, G Z. In particular, |F| is T-invariant and thus constant almost everywhere 
by ergodicity. One can then write F = c-e{P) almost everywhere for some constant 
c and some measurable P : X ^ R/Z. From (|36p one has |c| = 1 — o(l), and in 
particular c is non-zero. From ([37[) we then have T^P~P = Qh for all h G Z, and in 
particular P is polynomial of degree < A; — 1. We now have \\,f — e{P)\\]^i(x) — o(l), 
and the claim follows. 

5. Near-extremisers of the Holder and Young inequalities 
Let {X,X,fj,) be a measure space. The Holder inequality asserts that 

ll/l ■ ■ • fm\\LP{X) < ||/i||lpi(X) • • • ||/m||LP™(Jf) 

whenever < pi, ... ,pm,P < oo are such that ^ + . . . + j- — ^ and fi £ L^* (X) 
for i = 1, . . . , m. If the pi, . . ■ ,Pm,P are finite and the fi are not almost everywhere 
zero, it is well known that equality occurs if and only if the are all constant 

multiples of each other up to almost everywhere equivalence. 
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We now establish a stable version of the above assertion: 

Lemma 5.1 (Near-extremisers of Holder). Let m > 2, let {X,X,ii) be a measure 
space, and let < pi, . . . ,Pm,P < oo be such that ^ + . . . + ^ — i, and for each 
i — 1, . . . ,m, let fi ^ LP^{X) be such that \\fi\\LPi{x) > Oj o,n-d- such that 

ll/l • • • fm\\LP{X) > (1 - e)ll/l||LJ'i(X) ■ • • ||/m||Lp™(X) 

for some £ > 0. Then there exists a measurable subset E of X with 
for all i = 1, . . . ,m, and positive real numbers ci, . . . , c,,,. > such that 

c,\f,{x)r = (1 + o™^p„...,p„(i))i/i . . . /,„(x)|p. 

for all X G X\E and i = 1, . . . , m. In particular, one has 

cm^)r = a+om,p,,...,p„mcj\f,ix)r 

for all X G X\E and i, j — \, . . . ,m. 

To prove this lemma, we first need a simple measure-theoretic lemma: 

Lemma 5.2. Let [X,X,^) be a measure space, and let f,g : X ^ R+ be non- 
negative absolutely integrable functions. Suppose that 

g dfi<e f d^i 
X Jx 

for some £ > 0. Then there exists a set E with 

/ f dfi<^/e f dfj. 

JE Jx 

such that 

g{x) < V~ef{x) 

for all X e X\E. 

Proof Set E::^{xeX : g{x) > y/efix)}. Then 

/ f d^i< I g d^<^/e f dji 
J E Jx Jx 

and the claim follows. □ 

Proof of Lemma \5.1[ We abbreviate Om.pi,....p„, (1) as o(l). By induction on m, it 
suffices to verify the case to = 2. By replacing fi with \fi\P for i = 1,2, we may 
assume that p = 1 (thus 1/pi + l/p2 = 1) and that /i,/2 are non-negative. By 
homogeneity we may rescale so that ||/i||lpi(x) — ||/2||lp2(js:) = 1. We then have 



fif2{x) dfi{x) > 1 - £. 
; elementary inequality 

/i/2(x) d^,ix) < [ (-fr{x) + -fr{x) \ dfi{x) = 1. 

Jx \Pl P2 J 



IX 

On the other hand, we have the elementary inequality 



X 
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, -fn^) + -/r(x) - /1/2W ) d^iix) = 0(1). 



We thus have 



As the integrand is non-negative, we can thus apply Lemma 15.21 to obtain a mea- 
surable set E such that 

'-/rw + -/r(^)) df,{x)^o{i) 

E \Pl P2 J 

and 

(38) -fn^) + -/r(^) - hf2{x) = o f-/r(x) + -f^x) 

Pl P2 \Pl P2 

for all X ^ E. Writing ff^{x) = exp(a) and /f^(a;) — exp(6), one can rewrite 
as 

exp ( — a H b ) = (1 — o(l)) ( — exp(a) H exp(6) ) . 

\Pi P2 J \Pi P2 J 

Using the convexity of the exponential function (and normalising to, say, max(a, b) = 

if desired) we conclude that a — b + o{l), or in other words that ff^{x) — 

(1 + 0(1))/!'" (x), and the claim follows. □ 

We now use the above lemma to analyse near-extremisers to Young's inequality. We 
begin by recalling this inequality, together with its proof via Holder's inequality: 

Proposition 5.3 (Young's inequality). Let G — {G,+,B,fj,) be a locally compact 
abelian group, and let < r < p, q < s < 00 be such that h + ^ h + h ■ Then for 

r s p q 

any f S L^iG) and g € L'^{G), one has 

(39) (^lj{T''f)9\\UG)Mh)y <\\f\\LHG)\\g\\L.iG)- 

In practice, we will only apply this inequality with the exponents 

(40) P = q=Pk = j^^; r=pk-i = —^] 5 = 2'^-! 

for some fc > 2; one easily verifies that the hypotheses of this proposition are 
satisfied. 

Proof. Let h E G. From the factorisation 

\T'f\\9\ = \T'^f\'^\g\'^i\T'^fix)ngix)n'/', 
Holder's inequality and the identity 

1 s ~ n s — q 1 

- = + + - 

r sp sq s 

one obtains 
(41) 

ll|rVll5llU.-(G)< ll|rVl'^llw<=-.)(G)lll5l'^llw(=-,)(G)ll(|7^V(^)ri5(^ 
which we rearrange as 



||(T''/).9ll^(G) < \\f\\T.fG)\\9\\T4G) / \T'^fi^)n9{x)\'' dii{x) 
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for any h ^ G. On the other hand, from Fubini's theorem we have 
(42) j^jjT'^f{xr\g{xr dfi{x)df,{h) = II.9IIL(g)' 

and m\i follows. 



□ 



If iJ is a compact open subgroup of G, and xq + H, x'q + H are two cosets of H, 
then a brief calculation shows that equality will hold in (|39p if / is a scalar multiple 
of Ixo+H, and g is a scalar multiple of Ix'^+H- It was observed in [lOj that this is 
essentially the only such example. More precisely, we have: 

Proposition 5.4 (Inverse Young inequality). [101 Let the notation and hypotheses 
be as in Provosition 1 5. 3\ Assume the normalisation 



(43) 

and assume that 
(44) 



II/IIlp(G) - \\9\\li{G) 



l/s 



> 1 



for some e > 0. Then there exists a compact open subgroup H of G and cosets 
xq + H , x'f^ + H such that 



LP{G} 



= 0(1); \g\~t,iH)-'/n,, 



+H 



Li{G) 



^p^q^r.s (f ) ■ 



This result does not explicitly appear in [10 , but follows from the methods in that 
paper. For the convenience of the reader we now give the proof of this proposition 
here. We first need an inverse sumset estimate: 

Lemma 5.5 (Inverse sumset estimate). Let G be a compact abelian group with 
Haar measure n, and let K C G be a compact set of positive measure such that 

where K — K := {k — k' : k,k' S K} is the difference set of K with itself. Then 
K — K is a compact open subgroup of G. 



Proof. Let a and b both be elements of K — K. Writing a as a = fc — fc', we see 
that K — k, K — k' both lie \a K — K, which has measure less than |/i(i4r), and 
so K — k and K — k' must intersect in a set of measure greater than ^^{K)] we 
conclude that ^l{K r\ {K + a)) > ^fi{K). Similarly ^j.{K n {K + b)) > ^fJ.{K), and 
hence ti{{K + a) (1 {K + a + b)) > ^^{K). By the triangle inequality, this forces 
li{K n {K + a + b)) > 0, and in particular K n {K + a + b) is non-empty. In other 
words, a + b £ K ~ K. We thus see that K — K is closed under addition; as it is 
also symmetric, it is a group. As K — K is compact and has positive measure, it 
must also be open, and the claim follows. □ 



Now we can prove Proposition 15.41 
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Proof of Proposition \5.4\ We abbreviate Op,g,r,s(l) as o(l). We may assume that e 
is small, as the claim is trivial for large e. Observe that we may relax the right-hand 
side of pi)) from 1— etol — o(l). We may similarly relax the normalisations P5)) 
to 

\\f\\LP{G), \\g\\Li{G) = 1 + 0(1). 

The point of doing this relaxation is that the hypotheses (and conclusion) of the 
proposition are now stable under perturbations of / by o(l) in the norm and 
perturbations of g by o(l) in the norm (the stability of ([^^ following from 
Proposition 15. 3p . We will exploit this freedom to perturb f,g by o(l) in these 
norms later in this argument. 

By replacing /, g with |/|, \g\ respectively, we may assume without loss of generality 
that /, g are non-negative. Define the non-negative functions A, B : G R"*" by 
the formulae 

A{h) := |l(rV).g|lI.(G) 

and 

Bih) / iT'^ f{x)rgix)'' df,{x). 
Jg 

Clearly A,B are measurable (in fact, they are continuous). From the proof of 
Proposition 15.31 and the hypotheses, we have 

0<A{h) < {l + o{l))B{h) 

for all h, and 

(45) l-o(l)< [ A{h) dn{h) < {1 + o{l)) [ B{h) dfi{h) < 1 + o{l). 

Jg Jg 

Thus there exists h such that 

< (1 - o{l))B(h) < A{h) < (1 + o{l))B{h). 

By translating / by /i if necessary (which does not materially affect either the 
hypotheses or conclusions of Proposition 15.41) . we may assume without loss of gen- 
erality that h — 0. Thus we have 

(46) Wfgh^iG) > (1-o(1))||/'^||w(=-p)(g)II5'^IIw(=-.)(g)II(/V)'/1Il=(g)- 
Applying Lemma 15.11 we can thus find a measurable set E with 

/ fP dfi, [ g'^diJi^ 0(1) 

J E J E 

and constants ci, C2, C3 > such that 

f^{x)^{l + o{l))c^f^{x)g^{x) 

g''{x) = {l + o{l))c2f^{x)g^{x) 

fP{x)g^x) = {l + o{l))csr{x)g^{x) 

for all X E G\E. By modifying f,g by o(l) in the U'{G),L'^{G) norms respectively 
we may assume that /, g vanish on so that the above estimates now hold for all 
X. In particular, / and g are supported on the same set H. Solving for /, 17, we 
conclude that there exist constants Ci , C2 > such that 

f{x) = (1 + o(1))Ci1h(x); g{x) = (1 + o{1))G21h{x) 
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for all X Cz G. Using the normalisation of f,g, and modifying f,g by o(l) in the 
LP{G),L'^{G) norms respectively, we may assume that H has positive measure and 

Inserting this back into (|45|) . we have after some algebra that 

/ ^iiH niH + h))'/"- d^i{h) > (1 - o{l))^l{Hy/''+\ 
Jg 

On the other hand, from Fubini's theorem we have 

(47) / fi{Hn{H + h)) dfi{h) = {1-0(1)) fiiHf 

Jg 

and thus 

/ fiiH n{H + h)){ti{H)'''-^ ~ fi{H n{H + h))'/"--^) dfi{h) ^ o{ii{Hy''-+^). 
Jg 

Applying Lemma 15.21 we can find a measurable set K such that 

/ fi{H n {H + h)) dfi{h) = o{fi{H)^) 
Jg\k 

and such that 

ii{H n{H + h)){fi{Hy^'-^ - fi{H niH + h)y/-'-^) = oininy^'-^fiiH n{H + h))) 

for all h G K. Discarding those h for which /i(i/ n {H + h)) — 0, we may thus 
assume that 

M(Hn(i/ + /i)) = (i-o(i))/i(i/) 

for all h E K. By continuity of the function h i-^ fi{H {H + /i)), we may take K 
to be compact. Integrating the above bound in h and using (j47|, we have 

^i{K) = {l + o{l))^l{H). 

Also, if h, h' e K, then 

fi{H n{H + h)),fi{{H + h)n{H + h- h')) = (1 - o(1))m(H) 

and hence by the triangle inequality 

^l{H n{H + h- h')) = (1 - o{l))n{H). 

Thus, for every k in the difference set K — K (which is also compact), one has 

fi{Hn{H + k)) = {l-o{l))t4H). 

Integrating this in k and using (j47|) , we conclude that 

fiiK -K)^{1 + o(1))m(H) = (1 + o{l))^i(K). 

By Lemma 15.51 we conclude that Hq :~ K — K is a compact open subgroup of 
G. By the above discussion, we have fj,{H n {H + h)) = (1 — o{l))^{H) for all 
h E Hq. Averaging in h, we conclude that 1h and j[(Jj^^Ho * Iff differ by at 
most o{ii{H)) in norm. As the latter function is constant along cosets of Hq, 
and the former function takes values and 1 and has a total norm of IJ,{H) — 
(1 + o{l)) ii{Hq) , both functions must differ from the indicator function of a single 
coset by o{h{Hq)) in norm. In other words, there is a coset x + Hq of Hq such 
that ijl{H n (x + Hq)) = (1 - o(l))/x(i?o), and thus ijl{H\{x + Hq)) = o(l). Thus, 
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after modifying f,g by o(l) in and L'' norm respectively, we may assume that 
H = X + Ho, and the claim follows. □ 



6. LP NEAR-EXTREMISERS ON GROUPS 

We can now quickly prove Theorem 11.41 Again, we only establish the claim for 
e > 0. Fix k > 2 and G. It suffices to show that if / G LP''{G) is such that 
ll/llLPfc(G) ^ 1 + o(l) and ||/||;7fc(G) > 1 ^ 0(1)7 then there exists a coset H = 
xq + Ho and a polynomial P : Ho — > R/Z of degree < fc — 1 such that ||/ — 
M(i/)-i/pn^^e(P(. - xoML^^iG) = 0(1). 

From (HI), ©one has 

/ iKrWiii'.-HG) d^i{h)>i-o{i). 

Applying Proposition [5J] we conclude that |1|/| - la;(,+_f/||^2'=/(fc+i)(G) = 

0(1) for some coset xo + H oi a subgroup H of G. By translating, we may set 
xo — 0; by modifying / by o(l) in ^^^^^\G) norm, we may assume that 

\f\^^l{H)-'^pnH. 

By dividing the Haar measure /i by (and multiplying / by fi{HY^P^) we may 

normalise so that /i(i?) = 1, thus we now have \ f\ ^ 1h- At this point we may 
restrict / to H, and the claim follows from Theorem ll.il 

7. ErGODIC theory ANALOGUES 



We can now adapt Proposition 15.31 and Proposition 15 .41 to ergodic systems: 

Proposition 7.1 (Young's inequality for ergodic systems). Let X = {X, X , fj.,T) 
be an ergodic system, and let < r < p, q < s < 00 be such that ^ + — | + |- 
Then for any f G LP(X) and g G L'^{X), one has 

(48) limsup(E,g[^]||(rV)5lli.-m)'/' < \\f\\Lv(x)\\9\\L^(xy 

_f/— J-oo 

Proof. Repeating the proof of Proposition 15. 3[ one has 

wmux) < \\f\\T4x)\\9\\r4x) [ irV(^)n<?(a;)r d^,ix) 

for any h On the other hand, from the ergodic theorem one has 

Jim^E,e[H] / \T'^f{x)\P\g{x)\'' d^i{x) ^ \\f\\l,^^^\\g\\l,^^y 
and the claim follows. □ 

Proposition 7.2 (Inverse Young inequality). Let the notation and hypotheses be 
as in Proposition ]?. 1\ Assume the normalisation 

(49) = Mlhx) = 1 
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and assume that 

(50) limsup fE,e[H] / ||(T'7)5|I1.-(x) V 

for some £ > 0. Then there exists a coset H of X and an integer h such that 

II I/I - ^l{H)-'^nH\\LHX) = 111.91 - KHr'^nT^Hh^iG) = Op,,,.,,(l). 

Proof. As before, we abbreviate Op^g^r.s(l) as o(l), assume e is small, and relax the 
right-hand sides of (jlH]), ((5(1)) to 1 + o(l) and 1 — o(l) respectively. Repeating the 
proof of Proposition 15.41 we may reduce to the case where 

for some measurable subset H oi X oi positive measure. To finish the proof, it 
suffices to show that H differs from a coset by a set of measure o{ijl{H)). 

Arguing as in the proof of Proposition l5.4[ we have 

limsupE;.e[^]A*(i/ n T'^i/)^/'- > (1 - o(f))/i(i/)^/'^+\ 

while from the ergodic theorem one has 

Also, fi{H n T^H)) is trivially bounded above by IJ.{H). We conclude that 
limsupE,,e[jv]M^nT''ff)2 > {1 - o{l)) ^l{H)\ 



which by Definition 11.21 implies that 

(51) \\lH\\uHx)>{l-o{l))niHf/\ 

To use this, we introduce the theory of the Kronecker factor. Let Zi be the sub- 
(j- algebra of X generated by the eigenfunctions of X (or equivalently, by the pure 
point spectrum of T). As is well known (see e.g. 23 ), the Kronecker factor 
{X,Zi,fi IznT) is equivalent to an ergodic rotation Zi = (Zi, Zi, /ii, Ti), i.e. a 
compact abelian group Zi = [Zi,+) with Haar measure fix and an ergodic shift 
Ti : z I— ^ z + a for some a e Zi. In [231 Lemma 4.3], it is shown that ||/||;72(x) = 
whenever the function / £ L°°{X) is such that the conditional expectation E(/|Zi) 
vanishes. In particular, 

\\1h -mH\Zi)\\u2ix)=0 
and thus by the triangle inequality 

l|lff||;72(x) = l|IE(l//|2'i)||[/2(_,f). 
Applying ([ST]). (fT3|) we conclude that 

||E(l^|Zi)|L4/3(;,)>(l-0(l))^(i7)3/4. 

Since 

I|e(1h|^i)IIlhx)-I|1h||lm^)=a*(^^) 

we conclude from Holder's inequality that 

l|E(lH|^i)|li.(x)>(l-o(l))Mi^)- 

Since 

UH\\hix)=KH) 
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we conclude from Pythagoras' theorem that 

\\1h -mH\Zi)\\h^x) = o{KH)). 

As a consequence there is a Zi-measurable subset H' that differs from H hy a. set 
of measure o(fjL{H)). Thus, without loss of generality, we may assume that H = H', 
i.e. that H is Zi -measurable. We may thus restrict X to the Kronecker factor Zi 
and assume without loss of generality that X = Zi. By [531 §3-2], the Gowers- 
Host-Kra seminorm C/^(Zi) on the Kronecker factor Zi coincides with the Gowers 
uniformity norm U'^{Zi) on the compact abelian group Zi. Applying Theorem ll.41 
we see that H differs from a coset by a set of measure o{fi{H)), as required. □ 

8. LP NEAR-EXTREMISERS ON SYSTEMS 

Now we establish Theorem 11.51 Again, we only establish the claim for e > 0. Fix 
k > 2andX. It suffices to show that if / S LP>'{X) is such that H/Hlp^x) < l+o(l) 
and ||./||;7fc(x) 1 ^ then there exists a coset H and a polynomial P : H ^ 
R/Z of degree < fc - 1 such that \\f - ii{H)-'^/P>'lHe{P{- - xo))\\LVk{x) = o(l). 

From (ini), dUl) one has 

Jim^E„e[if]|l(rV)7lli'"i(G) > 1 - o{l). 

Applying Proposition [72] we conclude that |||/| — ^(-ff)^^^^''' lH||^2fc/(fc+i)(x) = 

for some coset H of X. By modifying / by o(l) in /^^^^\X) norm, we may 

assume that 

If we then replace the system X with the normalisation of the coset H (as defined 
in Definition [HI), and set / : iJ ^ C be the function f{x) := ^liHf/P" f{x) for 
X € H, we see (as in the introduction) that 

ll/llLPfc(X) \\f\\LPk{H) 

and 

WfWw'iX) = WfWviH)- 

We may thus pass from X to H, and the claim now follows from Theorem ll.3l 

Remark 8.1. By combining Theorem ll.Sl with Theorem ll.4l we see that a measurable 
subset H of an ergodic system X is a coset of X (in the ergodic theory sense) if and 
only if _ff is a coset of the Kronecker factor Zi (in the group theory sense), modulo 
null sets. Indeed, the "if" part is easy. To see the "only if" part, observe that if 
_ff is a coset then 1h can be decomposed as a linear combination of eigenfunctions 
J2]jLi e(jfc/TO)lTJ_ff of T, and so H is measurable with respect to the Kronecker 
factor, which is a compact abelian group. By Remark II. 21 the Gowers and Gowers- 
Host-Kra norms of 1h then coincide, and the claim follows from Theorem 11.51 
and Theorem 11.41 Alternatively, one can observe from the density of the orbit 
{na : n e Z} of the ergodic shift a € Zi in Z\ that for every 6 G Zi, H + 9 is either 
equal to H or is disjo int from H modulo null sets, which then implies that H is a, 
coset of Zi in the group-theoretic sense up to null sets. 
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9. The Gowers norm on Euclidean spaces 

We now prove Theorem 11.61 Our main tool is the following refinement of Proposi- 
tion [531 in the Euclidean case, due to Beckner 4 and Brascamp and Lieb [7 : 



0<r<p, g<s<oo be such that f + ^ = | + |- Then for any f e LP(R") and 

l/s 

(52) 



Theorem 9.1 (Sharp Young's inequality). [4j, ^ Let n > 1 be an integer, and let 
< r < p,q < s < CO 
g e L''(R"), one has 

^JI(T^'7)5ll^(R.) dh^ ' <(Ap/.A/,M,/,,)"/''||/||l.(r.)||<?||l.(R") 

/ 1/p n1/2 

where Ap :— {j^yr/VJ '^'"^^ 1/p+l/p' = 1. If f,g are non-negative, then equality 
occurs if and only if one has f — coe~P*^^~^''^'*^'-^^^°^ and g — c^Q~<ii^-^i)-^'^i^-^i) 
for some Cq,ci G C, xo,xi G R", and positive definite M. 

Remark 9.1. The results of Beckner and Brascamp-Lieb directly handle the r = 1 
case of this theorem, but the general case then follows by the substitution f i-^ \f\^, 
9 '-^ IsT, s/r, p^ p/r, q/r. 

Specialising this inequality to the exponents PO)) and g = f for some fc > 2, we see 
that 

X l/2'=-i / 42 X nfc/2''-i 



'R" 

A calculation shows that 



^2fe/(fe+i) _ 21AA:1/2 



and thus 

„ / ^2fc/(fc+l) \ 1/2 

Because of this, pT)) follows from ©, (|5^ . and induction. 



From Theorem 19.11 we see from the above argument that equality holds when / 
takes a gaussian form 

fix) = ce'i^~^o)-M{x-x„) 

for some c G C, xq G R" and a positive-definite nx n matrix M. From pi)) we see 
that the same holds for functions / of the form p9)) . 



Remark 9.2. One can of course also verify equality in the Gaussian case by direct 
computation. To illustrate this we consider the model case when n = 1 and f{x) = 
g-7r|£c| ^ rpj^i^ right-hand side of (|17l) is easily seen to equal 

(2V(d+l))-(''+')/''^' 
while the left-hand side is equal to 

IdetMpi/^ 
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where M is the d + 1 x d + 1 matrix for the quadratic form 



Q{x,hi,...,hd) := 


E 




h LJihi + . . . 


A short computation shows that 










/ 2d 

2d-i 


2d-l 2d' 
2d-l 2d- 


'1 

-2 


... 2d-^\ 
. . . 2d-^ 


M = 


2d-i 


2d-2 2d- 


-1 


. . . 2d-^ 




^2d-i 


2d-2 2d- 


-2 


... 2d-^) 



Using Gaussian ehmination (subtracting half of the first row from the remaining 
rows) one sees that 

det M = 2'^2^d-2)d ^ 2d(d-i) 
and the claim then follows after another short computation. 

Now we consider the converse problem. Suppose that equality holds in ([T7|) . Since 



1/I1;7''(R" 



< 



!7'=(R" 



equality must also hold for |/|. By Theorem 19.11 and the above argument, we thus 
see that 

\f{x)\ = ce~(="~^°)-^^(=""^°) 

for some c > 0, xo G R" and a positive-definite nxn matrix M . We may of course 
assume that c is non-zero, in which case we can normalise c — 1. By translation 
symmetry we can normalise x = 0, and by a linear change of variables we may 
assume that M is the identity, thus 

for some P : R" R/Z. Since e^'^' is always non-zero, and since / and |/| must 
have the same t/'^(R") norm, we see from ^ that 

A;,,...A,,P(a;) = 

for almost every hi, . . . ,hk,x e R'', and so P is a polynomial of degree at most 
k — 1. The claim follows. 

Remark 9.3. The same method also gives sharp Young- type inequalities for the 
Gowers inner product 



((/-)..e{o,i}^>c/nR") - / ■■•/ n c-^+-+"'^7<.i,...,c.,(x+wi/ii+...+ 

we omit the details. 

Remark 9.4. In the case A: = 3, the above inequality becomes 

li/lir/3(R..) <2-"/«||/||i.(R„). 

It is worth noting the large number of invariances that this inequality enjoys. We 
have already observed the invariance with respect to linear changes of variable, 
translation, and multiplication by quadratic phases e((/)(a;)) = e{x ■ Mx + ^ ■ x + 0). 
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But a calculation also shows that the norm (just like the norm) is preserved 
by the Fourier transform operation f f, where 

m / dx. 

As a consequence, the C/'^(R") norm is in fact preserved by the entire metaplectic 
representatiorQ on R". Further Fourier-analytic symmetries of this type will be 
implicitly exploited in the proof of Theorem 11.81 in the next section. 

10. Threshold for on systems 

We now prove Theorem ll.81 We begin with the first claim; the claim that 2~^^^ is 
best possible will be deferred to the end of this section. 

For any ergodic system X, define the Abramov factor A2 to be the factor A2 = 
{X, A2, M [a2 j where A2 is the sub-cr-algebra of B generated by the polynomials 
P of degree at most 2. It will suffice to show that 

(53) \\f\\uHx)<2-'^'\\f\\LHx) 

whenever / is orthogonal to the Abramov factor (thus E(/|^2) = 0). By a limiting 
argument (using (fT2|) ) we may assume that / G L°"{X). 

We now use the machinery of Host and Kra [53] to reduce to the case when X is a 
2-step nilsystem: 

Definition 10.1 (2-step nilsystem). A 2-step nilsystem is an ergodic system of 
the form {G/T,B, iJ,,T), where G is a 2-step finite- dimensional nilpotent Lie group, 
T is a discrete cocompact subgroup of G, X is equipped with the Haar probability 
measure and the Borel a-algebra, and the shift T is given by Tx — tx for some 

TdG. 

Recall that in ^23^, a factor Z2 of X was constructed with the property that 

(54) ll/l|[/3m=0 

whenever / was orthogonal to Z2 (see [23l Lemma 4.3]), and such that Z2 was a 
system of order 2 in the notation of [HH^^; see [331 Proposition 4.11]. Indeed, Z2 
was the maximal factor of order 2. It is easy to see that the Abramov factor A2 
is a system of order 2, and so is necessarily a subfactor of Z2. From ([5^ and the 
triangle inequality we see that 

= ||E(/|Z2)||t/3(x) 

for all / e L^iX), and so without loss of generality (replacing / with E(/|Z2) if 
necessary) we may assume that / is Z2-measurable. In particular, we may assume 
without loss of generality that X is of order 2. Applying [231 Theorem 10.1], 
we conclude that X is an inverse limit of 2-step nilsystems. Applying a limiting 
argument (noting that if F is a factor of X, then the Abramov factor of F is a 

^The metaplectic group is a double cover of the symplectic group on H?"; its (unitary) action on 
R" is known as the metaplectic representation is generated by modulations f{x) 1— > f{x)e{P{x)) 
by homogeneous quadratic polynomials P, linear changes of variable f{x) h-^ \ det{A)\^^^ f(Ax), 
and the Fourier transform. See e.g. [31) for further discussion. 



36 



TANJA EISNER, AND TERENCE TAG 



factor of the Abramov factor of X), we may thus assume without loss of generality 
that X is a 2-step nilsystcm, X ~ G/T. (Note that these reductions do not destroy 
the total ergodicity of the system.) 

Next, we can place the ergodic nilsystem G/T in a standard form. As discussed in 
[26l §4.1, 4.3], we may assume without loss of generality that the nilsystem obeys 
the following properties: 

• G/r is minimal (i.e. every orbit is dense) and uniquely ergodic. 

• The commutator group 6*2 [G, G] is a torus. 

• The group T is abelian and has trivial intersection with the centre of G 
(and in particular with G2). 

• G is spanned by the connected component G° of the identity and by r. 

• The Kronecker factor Zi is isomorphic to the quotient system G/(G2r), 
with the obvious factor map. 

In particular, the torus G2 acts freely on G/T. Because of this, every function 
/ e L?{G/T) has a Fourier decomposition 

(55) / = E /? 

where for each ^ in the Pontryagin dual G2, G L'^{G/T) is the function 

(56) /^x) := / e(-e(.92))/(g2x)d/.G.(ff2) 

JG2 

(with fiQ^ being the Haar probability measure on the torus G2) and the series is 
unconditionally convergent in L^(G/r), with the being orthogonal and obeying 
the Plancherel identity 

ll/llL2(G/r) = E ll/«llL2(G/r)- 

Observe that each is an eigenfunction of G2 action, in that 

(57) fd92x)=e{a92))fd^) 
for all X e G/r and 32 € G2. 

The U^{G/T) norm can be rewritten explicitly as an integral, as follows. Given a 
2-step nilpotent group G, define the Host-Kra group HK'^(G) (also known as Gj'^' 
or HP'^(G)) to be the subgroup of G'^"'^^^ given by the octuples of the form 

( n 

\w'G{o,i}3:tj^<Wi for all 1=1,2,3 

where g^j' £ G'lw'l ioi all u' £ {0, 1}'^, with the convention that Gq — Gi := 
G and G3 is trivial, and the product is ordered in some arbitrary fashion (e.g. 
lexicographical ordering on {0, 1}^ will suffice). One can check that this is indeed 
a group; see jTS]. For a 2-step nilmanifold, one can show that HK'^(r) is a discrete 
cocompact subgroup of the 2-step nilpotent Lie group HK'^(G) (see e.g. [24l §B.2]), 
and thus the quotient space HK^(G)/ HK^(r) C (G/r){*'^i>' is a nilmanifold with 
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a Haar measure fJ^HK" (G) / hk^ {r) ■ The U^{G/T) norm can then be expressed by the 
formula 

(58) ||/ll'L.3(G/r) = / n C^^^fM '^A*HK3(G)/HK3(r)(^) 

7HK3(G)/HK3(r)^4o^^j3 

where x = ixuj)ui£{o,i}^, and C : z t-^ 1 is the complex conjugatfon map; see [24j 
§B.3-B.5] (and also ,23 ). 

The Fourier decomposition reacts well to both the Abramov factor and the 
seminorm: 

Lemma 10.2. Let f E L°°(G/r), and let be the Fourier components of f. 

• f is orthogonal to A2 if and only if each f^ is orthogonal to A2 . 

• One has ||/||^3(G/r) = EjeG^ II/? ll^3(G/r) • 

Proof. We prove the first claim. The "if" part is trivial from the expansion (I55p . 
which is unconditionally convergent in Lp'{G/V). To show the "only if" part, one 
observes that the action of G2 commutes with T and thus preserves the property 
of being a polynomial of degree < 2, and the claim then follows from (j56p . 



Now we establish the second claim. By the unconditional convergence of the Fourier 
expansion / — /c in L/^iG/T) and (HH), we may assume without loss of generality 
that all but finitely many of the /j vanish. By (|55|) . the left-hand side expands as 



E 



(C.)..^o,u3^(5.)<o.>3^HK3(G)/HK3(r)^,^„^,j 

while the right-hand side expands as 



/ TT Cl"l/^^(x„) d^HK3(G)/HK3(r)(a;) 

iHK3(G1/HK3fn _r„,i. 



E 



/ TT C''^'/j(x^) dMHK3(G)/HK3(r)(a 

JHK3(G)/HK3(r)^g^0 



It thus suffices to show that the expression 

(59) / TT <^'"'/?c.(2;i.^) '^/^HK3(G)/HK3(r)(a;) 

JHK3(G)/HK3(r)^g^0_,p 

vanishes whenever the are not all equal to each other. 

Suppose now that G {0, 1}^ are such that 7^ ^^j', then there exists 32 € G2 
such ^^(.92) 7^ ^^'(32)- Call the integrand in ((59| F. Then from (|57)) we have the 
eigenfunction equation 

^((ff;.^")c^"e{o,i}3a;) = e(C(.,(52) - £.i^'{g2))F{x) 

for all X e HK^(G)/HK3(r), where (7tj" is equal to g2 when w" — oj, when 
a;" — uj' , and the identity otherwise. On the other hand, the element (5Lj")cj"e{o,i}3 
is easily verified to lie in HK'^(G'), and thus leaves the Haar measure /iHK3(G) / HK3(r) 
invariant. As the factor 6(^1^(52) — ^^'(92)) is not equal to 1, the integral in ([5^ 
must vanish, and the claim follows. □ 
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In view of the above lemma, we see that in order to prove the inequahty (|53p for 
/, it suffices to do so for each /j. In particular, we may assume without loss of 
generality that / obeys ((57|) for some ^ G G2. 

Next, we reduce to the case when G is connected, as follows. Suppose that G is not 
connected. In this case, G/G°T is a discrete compact group (i.e. a finite group) 
generated by t. As t is totally ergodic, this group must be trivial. Thus we have 
T = (7o7 for some go G G° and 7 G F. We have 

Tx = gox['j, x] 

and thus 

1) 

(60) T^x ^ glx[j,xY[j,gQ]^^ . 
In particular 

(61) fir'x) = figix)emb,x]) + -lil^Cill, 9o])). 

The group elements t, go have the same projection to the abelianisation G/G2T. 
As r is totally ergodic, we conclude from a theorem of Leibman 28 that 50 is also 
totally ergodic. Thus we can build a nilsystem with nilmanifold G° / {G° DT) = G/T 
and shift given by go, which has the same Haar measure as the original nilsystem 
G/T. From (|6T|) and a short calculation we see that / has the same norm with 
respect to the original nilsystem G/T as it does with the new system G° /{G° H F); 
also, from ()60p . every polynomial of degree < 2 in the original nilsystem remains a 
polynomial of degree < 2 in the new system and vice versa. Finally, the property 
(|57| is retained (possibly after restricting ^ to a smaller commutator subgroup 
[G°,G°] if necessary) after passing from the old nilsystem to the new one. From 
this discussion we see that we may assume without loss of generality that G = G°, 
so that G is connected. 

If ^ is zero, then / is now G2-invariant, and thus Zi -measurable. But as Zi is 
contained in the Abramov factor A2 , and / is orthogonal to A2 , this implies that / 
is trivial, in which case (j53p is also trivial. So we may assume that ^ is nonzero. In 
this case, / is invariant with respect to the subtorus := {52 & G2 ■ ^(52) = 0} of 
G2; by quotienting out by this subtorus, we may assume that ^-^ is trivial, so that 
G2 is now isomorphic to the unit circle R/Z. In the notation of [26], this means 
that the 2-step nilmanifold G/T is elementary. 

We will no longer need the hypothesis that / is orthogonal to the Abramov factor, 
as this has been superseded by the non-zero nature of ^ and the connected nature 
of G. We therefore discard this hypothesis, as this will free us to perform some 
additional transformations on the nilsystem G/T. 

The elementary connected nilmanifolds were classified in }26[ Lemma 12]. According 
to that lemma, the nilsystem G/T is isomorphic to a product G' /T' x (R/Z)™, 
where m > 0, G' is the 2-step nilpotent group G' = R^'' x R/Z for some d > 1 
with multiplication law 

{x, z){x, z') — {x + x' , z + z' + {Ax, x')) 

where A is a 2(i x 2(i a matrix with integer entries such that B := A — A* is 
nonsingular, and F' :— Z^'^ x {0}. 
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U, and A = I 

Indeed, our strategy will essentially be to reduce to this case. 



Remark 10.1. A model case to keep in mind is when d = 1, m = 0, and A = ^ ^ ^ 



The next step is to eliminate the torus factor (R/Z)™. For this we use a convenient 
Fubini-type theorem for the Gowers-Host-Kra norm: 

Lemma 10.3 (Fubini-type theorem). Let fc > 1. If X,Y are two ergodic systems, 
and / e X y), then 

ll/ll;7'=(xxy) < ll^ll/ycr) 

where F G L°°{Y) is the function F{y) := \\fy\\u''(x)> OLnd fy G L°^{X) is the 
function fy{x) := f{x,y). 

Proof. From [23l §3] , one has 

(62) WfWtix.Y)^ I n C\-^f{x^,yu)d^i^^\y{x,y) 

where the cubic measure space {X x Y)^''\ ^^^^y) is defined in [2^, §3]. An inspec- 
tion of that construction reveals that 

[fc] _ [fc] [fc] 

and thus (by Fubini's theorem) we can rewrite the previous expression as 

/ / n c\-^fyMd^i^x\^)d^^Yiy)■ 

From the Cauchy-Schwarz-Gowers inequality (see j23l Lemma 3.9]), we have 

/ n Cl-l/,Ja:.)d/z|^l(2;)< n ^(?/-)' 
wG{o,i}* i^e{o,i}* 

and the claim follows by applying (|62l) for F. □ 



From this inequality and the Fubini-Tonelli theorem, we see that to establish ((53|) for 
/, it suffices to do so for the functions fy-.x^ f{x, y) in G'/F' for all y e (R/Z)™. 
Thus we may assume without loss of generality that m — and G/F — (G'/F'). 

We can now view / as a bounded measurable function / : R^'' x (R/Z) — > C 
obeying the periodicity condition 

f{x + n,z + (Ax, n)) = f{x, z) 

for all X e R^'', n e 7?'^ , z G R/Z, as well as the frequency condition 

/(a:,z + 0) -e(C0)/(x,z) 

for some non-zero integer ^. By conjugation symmetry we may take ^ to be positive. 
(The reader may wish to keep the simple case ^ = 1 in mind for a first reading, as 
some of the more technical complications are avoided in that case.) We may thus 
factorise 



(63) 



!{x,z)^F{x)e{iz) 
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where F now obeys the twisted periodicity condition 

F{x + n)^ e{-£,{Ax, n))F{x) 

for all x£li?'^,n£ T?'^ . 

We make the observation that without loss of generality, we may refine the lattice 
r = T?'^ X {0} to any sublattice of F, thus lifting the nilsystem to a finite cover. 
Indeed, the total ergodicity of the system does not change (as can be seen by 
Leibman's criterion [28| for total ergodicity), and the I? and norms of / are 
also seen to be unaffected (renormalising the Haar measure to be a probability 
measure, of course). We shall need to exploit this freedom to refine the lattice 
shortly. 

We now transform the system to a Heisenberg normal form. As observed in |261 
§8] , there exists a 2d x 2d nonsingular matrix $ with rational entries such that 

where J is the 2d x 2d matrix 



As $ is rational and nonsingular, one can find a positive integer q such that 
^{qZ^'^) is a sublattice of F; we may also assume without loss of generality that 
q is even. Passing to this sublattice of F' and applying the change of variables 
{x, z) = ($(a;'), z), we may now assume without loss of generality that A — A* = J, 
at the cost of replacing F with (gZ^'' x {0}). Thus, in particular, we now have 

F{x + n) = e{~S,{Ax, n))F{x) 

for all X e R^'' and n G gZ^'*. 

We can write 



where A' is symmetric. If we then apply the change of variable {x, z) = (x, z + 



fecting the nilmanifold or / (here we use that q is even to show that qZ^"^ ^ {0} 
is unaffected by the change of variables). Thus we may assume without loss of 
generality that 



By lifting the nilsystem to a finite cover via the change of coordinates {x, z) — 
{qx',q'^z') and replacing ^ by g^^, we may assume without loss of generality that 
q — I. Thus, splitting x G R^'^ as (xi, X2) with xi,X2 € R'', the group G now has 
law 

{xi,X2,z){x[,X2,z') = (xi + x[,X2 + x'2, Z + z' + x[x2) 

and F now obeys the twisted periodicity condition 





^{A'x,x)), this has the effect 





F{xi +ni,X2 +n2) = e(-^rii • X2)F{xi,X2) 



LARGE VALUES OF THE GOWERS-HOST-KRA SEMINORMS 



41 



whenever xi,X2 G R"^ and ni,n2 E Z^- In particular, F is periodic in the {0} x Z'' 
directions, and can thus be viewed as a function on R'' x (R'^/Z''). 

A fundamental domain for G/T is given by [0, 1)'^ X [0, l)"* X (R/Z), and the Haar 
measure is given by Lebesgue measure. The L'^{G/T) norm of / can then be 
expressed as 

(64) WfhHG/r) = ( f |F(a;i,X2)P dxidxa) . 

\"'[0,l)''x(R''/Z<i) J 

Now we compute the norm. A calculation shows that HK'^(G') consists of tuples 
of the form 

with the property that the Xi^^ £ R"^ depend linearly on w, thus 

3 

(65) Xi^^ = Xi^Q + ^ Xi^-jUJ-j 

i=i 

for some Xifl,Xij € TV^, and the e R/Z obey the condition 

J2 (-1)1-1^. = 0. 

Indeed, these tuples are easily verified to form a connected Lie group that contains 
all the generators of HK'^(G'), and its tangent space is contained in the tangent 
space of HK^(G), hence the claim. The subgroup HK'^(r) arises when all the Xij 
are integers and the z^^ vanish. A fundamental domain for HK'^(G)/ HK'^(r) can 
then be specified by requiring the Xij to all lie in [0, 1). From this and (1631) . we see 
that 

3 

(66) \\f\\us<G/r)= n C^'"^Fi^l-^.^2.^) T[dXl,^dX2,^ 

J{io,irx{-Rm^r^^^oAP i=o 

where Xi^uj,X2,uj are defined by 

To simplify this expression, we take advantage of the periodicity of F in the {0} x Z"* 
to obtain the Fourier decomposition 

F{xi,X2)^ ^ Fk{xi)e{k ■ X2) 
fcez<* 

where Fk E L°°(R'^) obeys the twisted periodicity condition 

(67) Fk{xi + ni) ^ Fk+^mixi) 

for all xi G R'' and ni G Z*^. By an approximation argument we may assume that 
all but finitely many of the F^. vanish on [0, 1]''. 

We may now expand (|66p as the sum of expressions of the form 

3 
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where (fca;)(^G{o,i}3 are a tuple in Z"*. Because of the constraint (|65|) with i = 2, the 
X2,i integrals vanish unless the k^j take the form 



3 

(69) fccj fco + ^ feji^i 

i=i 

for some fco, fci, ^2, ^3 G Z'^. Assuming that (|69| holds, the expression (|68p simplifies 
to 

3 

The next step is to foliate Z'^ into cosets of f Z'^ (recall that ^ has been normalised 
to be a positive integer). Let Q be the fundamental domain {0, 1, . . . , f — 1}'*. We 
can then split as 

3 

E E / ,3 n ci-iFfc^+5„jxi,.) n^^i.^ 

where 

3 

Using (|67p. this expression can be rewritten as 

3 

E E / ,3 n Cl'^lFfcJa;i,^+n^) 

viewing the xi.i G [0, 1)'' and ni ^ e Z"* variables as the fractional and integer parts 
respectively of a variable yi S R'', we can rewrite this as 

3 

E / ,3 n Cl^lFfcJy^) \{dy,, 

where 

3 

■.= yo + ^yjUjj. 

By the Cauchy-Schwarz-Gowers inequality (see e.g. [531 Lemma 3.9]; the extension 
to R'' is routine), we can upper bound this expression as 

E n ii^fc-iic/-3(R.'^)' 

and then applying Theorem 11.61 we can upper bound this in turn by 

koMMM&Q cje{o,i}3 

From (p7)) we see that the function k ||Ffe||£2(R^d) is periodic with period ^Z'', 
and can thus be viewed as a function g : T/- jtTI^ — > R. We can then rewrite the 
above expression as 

2"^ ^ 5(A:o + wifci + W2A:2 + wsfcs) 

feo,fei,fc2,fc3ez<'/^Z'i we{o,i}3 
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which by ^ is just 

2 ''C^ll5llu3(zd/jzd) 

which by (IT^ is bounded by 

2 '^(^\\9\\L^{Z''/iZ'')- 
Using (1671) and Fubini's theorem, one can rewrite this as 



2 I X! ll-^'=llL2([0,l)'i) 

which by Plancherel's theorem is equal to 

2 '^ll-P'llL2([oa)'ix(R/z)<i)' 



and the claim ([53l) then follows from (|64l) and the fact that d> I. This concludes 
the proof of the first part of Theorem 11.81 

10.1. Sharpness. The above argument also shows that the 2~^/^ constant in The- 
orem [T^ is sharp. Indeed, we consider the Heisenberg nilmanifold G/T in which 
G := X (R/Z) has the group law 

{xi,X2,z){x[,x'2,z') (xi + x'i,X2 + x'2, z + z' + x[x2) 
and r := X {0}, and consider a function / : G/T — > C of the form 
f{xi,X2,z) = e(z) ^ Fk{xi)e{kx2) 

k 

where the : R — > C are functions obeying the periodicity condition 

Fkixi + ni) ^ Fk+m {xi ) 

for all k.n £ Z and xi G R. Thus we can in fact write 

f{xi,X2,z) = e(z)^Fo(xi +k)e{kx2)- 
k 

(The relationship between / and Fq is somewhat similar to that of the Zak transform 
used in signal processing.) The above calculations then show that 

ll/llL2(G/r) = II-FoIIl2(r) 

and 

ll/ll;73(G/r) = ll-Po||;73(R)- 

The optimality of the constant 2~^/^ then follows from the optimality of 2^^/^ in 
Theorem II. 6[ by taking Fq to be a gaussian (which makes / essentially a theta 
function) . 

Remark 10.2. The total ergodicity hypothesis was needed in order to reduce to 
the case when the nilpotent group G was connected. Without this hypothesis, 
additional nilsystems can occur which are not contained in the Abramov factor, for 
which the inequality (j53p is not reducible to Theorem 11.61 thus requiring a further 
analysis. A model example arises by setting G equal to the semidirect product 
Z K R^, where the generator e of Z acts by conjugation on R^ by the formula 

e{x, y)e^^ :— {x, y + x/m) 
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for some integer m > 1. If we then set F := mZ k Z^, and let r e G be the group 
element r := (a, 0)e for some irrational a G R, then G/T becomes an ergodic (but 
not totally ergodic) 2-step nilsystem, with fundamental domain given by {x,y)e^ 
with x,y [0, 1) and j G {0, . . . , m — 1}, and orbit given by 

t"(x, y)e^T = ({x+na}, {y+nx/m+ ""^^ ~ ^K /m- \x+na\ !^-tl)e("+i) -""d "r. 

2 m 

One can verify that the only polynomials of degree at most 2 on this factor are 

actually of degree 1, so the Abramov factoJl is equal to the Kronecker factor (i.e. 

the sets which have trivial behaviour with respect to the y coordinate). From 

Theorem 11.51 one must have a bound of the form ||/||(73(G/r) ^ c||/||2^2((3/p-) for all 

/ orthogonal to the Abramov factor and some absolute constant c < 1, but it is 

not clear to us what this optimal constant is. 

Remark 10.3. The above arguments in fact suggest that there should be multiple 
thresholds; in particular, if X is totally ergodic and ||/||(73(x) > '2~'^^^\\f\\L^{x), 
then / should correlate with a factor generated by the Abramov factor and finitely 
many Heisenberg nilsystems of dimension less than 2c? + 1 . We will not quantify 
this claim precisely here. 

11. Threshold for on an interval 

We now prove Theorem 11.91 We argue by contradiction. If the claim failed, then 
we could find 77 > 0, a sequence N = iV„ of positive integers, and functions / = 

/„ G L°°{[N]) such that |i/|lL~([7V] < 1 and \\f\\umN]) > 2"^/^ + 77, but such that 

(70) (/,e(P))i.([^^])=o(l) 

uniformly for all polynomials P : [N] — > R/Z of degree at most 2, where for the 
purposes of this section, o(l) = o„_i.oo(l) denotes a quantity that goes to zero as 
n ^ 00. We will show that (possibly after passing to a subsequence) one has 

||/||r/3([iv]) <2-i/« + o(l), 

which will give the required contradiction. 

If the N — Nn stay bounded in n, then the condition ([70|) and the Plancherel 
theorem imply that / has an L'^{[N]) norm of o(l), in which case the claim follows; 
thus (after passing to a subsequence if necessary) we may assume that Nn — 00 as 
n — > 00. 

The next step is to apply the arithmetic regularity lemma from [20]. We state a 
form of this lemma suited for our needs: 

Lemma 11.1 (Arithmetic regularity lemma). Let T : R+ R+ be a nondecreas- 
ing Junction with J-{M) > M for all M , let e > 0, let N be an integer, and let 
f G L°°{[N]) with ||/||L°°([Ar]) < 1. Then there exists a quantity M < C^.j^ and a 
decomposition 

f — /nil ^" fsnil funf 

^On the other hand, if one hfts to the finite extension G/(mZ K mZ^), then the Abramov 
factor becomes the entire system. So one way to extend Theorem 11.81 to the non-totally-ergodic 
case is to allow the polynomial P to lie in an extension of X, rather than in X itself. Similar 
objects have also recently been considered by Szegedv[32). 
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into functions /nil, /smi, /unf G L°°{[N]) obeying the following properties: 

• /nil is a {J- {M) ^ N) -irrational virtual nilsequence of degree < 2, complexity 
< M , and scale N . (We will define this term shortly.) 

• ll/sml||L2([jV]) < 

• ll/u„f||c/3([Ar]) < 1/J-(M). 

• ll/nil||L°°([Ar]) < 1- 

Proof. See [2Ql Theorem 1.2]. The theorem there is stated for functions taking 
values in [0, 1], but the extension to complex- valued functions bounded in magnitude 
by 1 is routine. □ 

We now pause to recall some definitions from [50] used in the above lemma; these 
definitions work in arbitrary degree, but we specialise to the degree < 2 case for 
simplicity. We begin with the concept of a filtered degree < 2 nilmanifold, which 
is a slight variant of a 2-step nilmanifold in which the group is required to be 
connected (and simply connected), but the commutator group G2 can be replaced 
with a larger central subgroup of G. 

Definition 11.2 (Filtered nilmanifold). A filtered degree < 2 nilmanifold consists 
of the following data: 

• A connected, simply connected 2-step nilpotent Lie group G — G'(o) — ^(i); 
together with a connected, simply connected central subgroup G[2) ihat con- 
tains the commutator subgroup G2 = [G, G] ; 

• A discrete cocompact subgroup T of G such that ^{2) '■— rnG'(2) is cocompact 
in G(2); 

• A MaVcev basis X for G/T (see [20j for a definition; we will not need to 
know the specific properties of such a basis here). 

We say that a nilmanifold has complexity at most M for some M > 2 if G has 
dimension at most M , and the rationality coefficients of the Mal'cev basis (see \19\ 
Definition 2.4]j is bounded by M. The Mal'cev basis endows G/T with a metric, 
the exact definition of which we omit here. 

A polynomial sequence g : Z — > G (of degree < 2) is a sequence of the form 

g{n) = gogig^^\ where go,gi £ G and 92 G ^(2)- U -^t^ ^ 1; say that g is 
{A, N) -irrational if one has 

l!6(ffi)llR/z < A/N 

and 

||6(.g2)||R/z < A/N 

whenever ^1 : G — > R/Z is a nontrivial continuous homomorphism annihilating T 
of Lipschitz norm at most A (using the metric on G/T), and ^2 ■ G(2) — > R/Z is a 
nontrivial continuous homomorphism annihilating T(^2) of Lipschitz norm at most 
A (using the induced metric on G(2)/T(2)). Here ||a;||R/z denotes the distance from 
X to the nearest integer. 
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The precise definition of complexity is not particularly important for our purposes; 
the only property we need is that for any fixed M, there are only finitely many 
filtered degree < 2 nilmanifolds of complexity at most M , up to isomorphism. 

Definition 11.3 (Virtual nilsequences). Let A,M,N > 2 be integers. An {A,N)- 
irrational virtual nilscquence of degree < 2, complexity < M , and scale N is a 
function / : Z — > C of the form 

f{n) :— F{g{n)T ,n mod q,n/N) 

where 

• G/T is a filtered degree < 2 nilmanifold of complexity at most M ; 

• g : X ^ G is an {A, N) -irrational polynomial sequence; 

• q is a positive integer with q < M ; and 

• F : G /T X Z / qZ X R ^ C is a function of Lipschitz nor at most M . 

For further discussion of these concepts we refer the reader to [20] . 

We now apply Lemma 1 1 1 . 1 1 with e = £„ 1/n (say) and T — Tn — nT^ for some 
sufficiently rapid growth function To to be chosen later. This gives us a quantity 
M = Mn (which will likely grow quite rapidly in n) and a decomposition with the 
stated properties. From ([T2|) we have 

11/ - /nil||c/3([JV]) = o(l), 

SO it will suffice to show that 

||/„ii||c/3([A']) <2-i/« + o(l). 

From ()15p and the Cauchy-Schwarz inequality one has 

(/s„,i,e(P)),(/u„f,e(P)) -o(l) 

uniformly for all quadratic polynomials P, so by the triangle inequality and hy- 
pothesis we also have 

(/„ii,e(F)) =0(1) 
uniformly for all quadratic polynomials. 

By a diagonalisation argument (and choosing sufficiently rapidly growing de- 
pending on M), it thus suffices to establish the following: 

Proposition 11.4. Let M >2 be a fixed integer (independent of n). Suppose that 
N = Nn,A = An are sequences going to infinity, and f — fn ^ L°°{[N]) is a 
{A, N) -irrational virtual nilsequence of degree < 2, complexity < AI, and scale N 
such that 

Uh^am) < 1 

and 

(71) (/,e(F))=o(l) 
uniformly for all quadratic polynomials P. Then 

\\f\\umN])<2-'/' + o{i). 

^To define this precisely, one needs to specify a metric on G/F X Z/qZ X R; the exact choice 
of metric is not important for our arguments, though, and any reasonable construction will suffice 
here. 
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We now prove this proposition. Fix A/. As there are only finitely many isomorphism 
classes of G/T with a fixed complexity bound, we may thus (after passing to a 
subsequence) also fix the nilmanifold G/T. For similar reasons, we may fix the 
period q. 

Write 

f{n) :— F{g{n)T ,n mod q,n/N). 

As / has L°°{[N]) norm at most 1, we may clearly (after truncating F if necessary) 
assume that F is also bounded in magnitude by 1. 

The [/'^([A^]) norm of / can now be computed asymptotically using the arithmetic 
counting lemma from [20, Theorem 1.11], which roughly speaking asserts that the 
triplet {g{n)r, n mod q, n/N) is uniformly distributed in G/T x Z/qZ x [0, 1] for the 
purposes of computing arithmetic averages such as the Gowers uniformity norms: 

Proposition 11.5. One has 

ll/l|L2([Ar]) = \\F\\L2(G/rxZ/qZx[0,l]) + o(l) 

and 

\\f\\m{[N]) = / 

JHK3([0,l])xHK3(Z/<jZ)xHK3(G,G(2))/HK3(r,r(2)) 

Yl Cl"lF(x^,y„,z^) dAiHK3(G,G(2))/HK3(r,r(2))W 
we{o,i}3 

C?MHK3(Z/gZ)(y)c?AiHK3([0,l])(^) + "(l) 

where x = (2^tj)(.c;g{o,i}3 '"^'^ similarly for y,z, HK'^([0,1]) is the restriction of 
HK'^(R) to [0, IJ't"'-'^^ with the normalised Lebesgue measure A'hk3([o,i])7 

HK3(G,G(2)) 

is defined similarly as to HK'^(G) but with G(2) taking the place of the commuta- 
tor group G2, and similarly for HK''(r, r(2)). (L/qTi will be endowed here with 
normalised counting measure.) 



Proof. (Sketch) If F(x,y,z) is independent of the y,z coordinates this follows di- 
rectly from [201 Theorem 1.11] and a routine calculation. The dependence on z 
then be inserted by approximating by a piecewise constant function in z, ap- 
plying [20[ Theorem 1.11] to each piece, and summing to obtain a Riemann sum 
that then converges to the required integral. (Note that F is Lipschitz continuous 
and thus Riemann integrable.) The dependence on y can be inserted by similarly 
decomposing the left-hand side into summations over residue classes modulo q and 
applying the preceding type of computations to each such residue class. □ 

In view of this proposition, it suffices to establish the estimate 

/ 

./HK3([0,l])xHK3(Z/qZ)xHK3(G,G(2))/HK3(r,r(2)) 

rfMHK3(G,G(2))/HK3(r,r(2))(2:)c^AiHK3(Z/9Z)(2/)c^MHK3([0,l])(^) 

< 2"^||F||i2(c/rxz/qZx[o,i]) +o(l)- 
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To show this, we must first convert the information (|7ip into a cancellation property 
of F: 

Proposition 11.6 (Cancellation property). Let G2 '■— [G, G] and T2 '■— G2 H T, 

then G2/Y2 is a torus that acts on G/T, and we have 



(73) / F{g2X,y,z) dfiG2/r2i92) ^ o{l) 

JG2/T2 

uniformly for all {x,y,z) G G/T x Z/gZ x [0,1]. 

Proof. We will use an argument from f5(T, §7]. Write the left-hand side of ([75)1 as 
F{x,y,z), then F is also Lipschitz continuous (uniformly in n) and is also G2/T2- 
invariant. It will thus suffice to show that the quantity 

ll-^llL2(G/rxZ/gZx[0,l]) = (^>^>L2(G/r) 

is 0(1). Applying (a depolarised variant of) Proposition 1 1 1 . 51 we have 

{f,f)L^(lN]) = {F,F)L2{G/rxz/qZxios]) +0(1) 
and so it suffices to show that 

{fJ)L^[N]) = 0(1). 

In view of (j7ip (and the uniform bound on /), it suffices to show that for any 
e > 0, f can be approximated uniformly to error e by a finite linear combination of 
quadratic polynomials e(P), where the size and number of coefficients is bounded 
uniformly in n for fixed e. 

The function F is G2/r2-invariant, and so one can quotient out by this group 
and reduce to the case when G2 is trivial, i.e. G is abelian. In this case, G/T is 
isomorphic to a torus (R/Z)™, and P : Z — >■ R™ is a quadratic polynomial, thus 

/ = F{P{n) mod Z™, n mod q, n/N). 

The claim then follows easily from the Weierstrass approximation theorem. □ 

By modifying F (and thus /) uniformly by o(l), we may now assume that 



(74) / F{g2X,y,z) dnG2/r2i92) = 

JG2/T2 

for all {x, y,z) <E G/TxZ/qZx [0, 1]. It will then suffice to show the inequality ([72)) 
(with no 0(1) error term) whenever _F is a bounded measurable function obeying 

CM- 

By mimicking the proof of Lemma ri0.3[ we see that it then suffices to establish the 
inequality 

/ H Cl"lF(x„)d^HK3(G,G(,))/HK3(r,r(,))(a^) <2-i||P|U2(G/r) 

JHK^G.G^2))/liK^{^,^^2))^^^o^^y, 

whenever F obeys the cancellation property 



(75) / F{g2x) dtiG2/r2i92)=0 

JG2/T2 

for aU X e G/T. 



LARGE VALUES OF THE GOWERS-HOST-KRA SEMINORMS 



49 



Using Fourier decomposition on the torus G{2)/^{2) (which acts on G/T) as in the 
previous section, we may assume without loss of generahty that there is a character 
^ : G(2)/r(2) R/Z such that 

^(.922:) = e{^{g2))F{x) 

for all X £ G/T and 32 G G'(2)/r(2)- If ^ annihilates G2/T2, then from (fM| we see 
that F is trivial, and the claim follows in this case; so we may assume that ^ is 
non-trivial on 6*2 /r2. 

As in the previous section, F is now invariant with respect to the orthogonal com- 
plement := {(72 G G(2)/r(2) : C(ff2) = 0} of We can then quotient out 
by that complement and reduce to the case when G(2)/r(2) = G2/r2 is the unit 
circle R/Z. At this point, the nilmanifold G/T becomes a connected elementary 
nilmanifold in the language of gl] (and HK^(G, G(2))/ HK^(r, r(2)) simplifies to 
HK^(G)/HK^(r)), and the claim now follows from the results of the previous sec- 
tion. 

Remark 11.1. An inspection of the above argument reveals that the condition 
||/||[/3([jv]) > 2^^/^ + 77 in Theorem 11.91 can in fact be relaxed to ||/||(73([Ar]) > 

2-^/'\\f\\m[N])+V- 

12. Threshold for on cyclic groups 

We now use Theorem 11.91 to prove Theorem 11.101 Fix /, N, rj as in that theorem. 

For each integer M > 1, let /(*^) — Z^^^-* : [MN] — !• C be the "unwrapped" version 
of / defined by 

fC^'^^n) f{n mod N) 
for n £ [NM]. A simple calculation shows that 

\\f^''^\\!jmNM]) = \\f\\fmz/NZ)+0{l/M). 

In particular, one has 

ll/^*'^llt/3([iVM]) >2-i/« + 77/2 

for all sufficiently large M. Applying Theorem 11.91 we conclude that for all suf- 
ficiently large M, there exists a polynomial PM{n) — ctMn'^ + I^Ain + 7j\/ with 
dM, f^Mj'^M £ R such that 

\{f^''\eiPM))LHlNM])\>civ) 

where c{ri) > is independent of N and M. We may normalise so that 7m — 
and aM,fiM G [0, 1], thus 

(76) \^n£[NM]f{n mod N)e{-aMn^ - l3Mn)\> c{ri). 

The basic problem here is that aM, Pm are not a priori known to be integer multi- 
ples of 1/iV, so that n ^ aun^ + Pmn does not descend to a polynomial in 7i/N7i. 
To resolve this, we use the Weyl theory for exponential sums. Expressing an ele- 
ment n — [NM] as n — mN + a with m £ [M ] — 1 and a £ [N] , and using the 
triangle inequality, one obtains 

Eae[JV]|IEmg[M]-ie(aM(miV -I- of - l3M{mN + a))\ > c{r]) 
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and thus by Cauchy-Schwarz 

Eae[w]|IE,„G[A/]-ie(aAf(miV + a)2 -/3jv/(m7V + a))p > c{r])^. 

The left-hand side can be expanded as 

^m,m'e[M]-i^ae[N]e{(^M{mN + m'N + 2a)(m - m')N - /3m(™ - m')N). 

By the triangle inequality, we thus have 

IEm,m'e[Af]-i|Eae[JV]e(aAf(2a)(m - m')N)\ > c{irif . 

Thus, for at least c^qY /2A'P values of m, m! E [M] — 1, one has 

Ke[N]eiaMi2a){m ~ m')N)\ > c{rif /2, 

which by the geometric series formula implies that 

\\aMN{2a){m - m')||R/z < C{ri)IN 

for such m,m', and some constant C(ry) > depending only on rj. Applying a 
lemma of Vinogradov (see (THJ Lemma 3.2]), we conclude that for each M and 
a G [N], there exists a rational ha,M/<la,M with |&a,M|, I'Za^fl < C'ivi) such that 

||2aaMA^- ^IIr/z < C\^)INM 

Qa.M 

where C'(rj) depends only on rj. By pigeonholing in the ba^M,<la,M and applying 
the Vinogradov lemma f [191 Lemma 3.2]) again, we can find rationals bM/qM with 
|&M|,|9Af| < C"{7j) such that 

||«M^-— ||r/z<C"('7)/^'M 
qM 

for some Cirf) depending only on -q. By pigeonholing, we may find a rational b/q 
with \q\< C"{t]) such that 

b 

for infinitely many M. We may normalise q to be positive. 

By ([7S|) pigeonholing, for each such (sufficiently large) M there exists an interval 
[mjv/iV, {niM + q)N — 1] such that 

Kie[mMN.XniM+q)N-i]f{n mod N)e{~aMn^ ~ /3Mn)\ > c{ri) 
and thus by translation 

|E„e[gA']/("- mod N)e{-aMn^ - A/"-)| > c{vi) 

for some real /3^,j, which we can normalise to lie between and 1. By passing to a 
subsequence we may assume that converges to a limit /3', and we conclude that 

\&ne[aN]f{n mod N)e{-^n' - P'n)\ > c{r,). 

We can write = ^ + ^ for some integer c and j^j < 1, and so 

|E„e[,w]/(n mod N)e{^P{n))e{-en/ qN)\ > c{ri) 

where P : Z — >■ R/Z is the polynomial P{n) bn^/qN + cn/qN mod 1, which is 
periodic with period qN. 
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By Urysohn's lemma followed by the Weierstrass approximation theorem (and us- 
ing the Arzela-Ascoli theorem to get uniform bounds), given any e > we can 
approximate the function x e{—9x) on [0, 1] to within e in -L^([0, 1]) norm by a 
linear combination of exponentials of the form x i— >■ e{kx) for integer k, with the 
size and mimbcr of such coefficients bounded uniformly in 6. Applying this with 
e = c{r])/2 and using the pigeonhole principle, we see that 

|E„e[9iv]/(n mod N)e{-P{n))e{-kn/qN)\ > c'{r]) 

for some integer k and some c'{ri) > depending only on r], and the claim follows. 
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